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ABSTRACT 


Neu  and  improved  algorithms  for  computation  in 
several  fundamental  polynomial  operations  are  presented. 
The  common  bases  for  these  algorithms  are 
generalizations  of  the  p-adic  technique  used  in  the 
constructive  proof  of  the  Hensel  Lemma.  Multivariate 
polynomial  operations  are  stressed  due  to  the  special 
importance  of  the  mul tivari ate  Hensel -type  construction 
in  replacing  the  modular  eva I uation-and- interpolation 
technique  under  certain  conditions.  Due  to  the 
availability  of  numerous  (not  completely  satisfactory) 
methods  for  the  computation  of  polynomial  greatest 
common  divisors  (GCD),  the  EZGCD  Algorithm  based  on  the 
Hensel  construction  is  given  special  emphasis.  An 
intuitive  computing  time  analysis  and  many  empirical 
experiments  are  made  to  compare  the  performance  of  this 
algorithm  with  two  other  major  methods,  especially  the 
Modular  GCD  Algorithm.  Both  theoretically  and  by  actual 
computing  data,  the  neu  EZGCD  Algorithm  demonstrates 
promising  efficiencies  by  taking  advantage  of  the 
sparseness  of  multivariate  polynomials.  An  intuitive 
and  more  "engineering"  approach  to  computing  time 
analysis  also  appears  to  give  quite  accurate  predictions 
of  actual  run  times  for  many  practical  problems.  Other 
applications  of  the  Hensel-type  constructions  resulting 
in  improved  algorithms  for  computing  polynomial 
factorizations,  contents  and  primitive  parts,  and 
square-free  decompositions  are  also  described. 
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CHAPTER  I.  INTRODUCTION 


I - 1 Introduction 


Uith  the  development  of  modern  computing  machines,  many 
numerical  algorithms  became  powerful  tools  used  by  an 
increasing  number  of  people.  Technical  terms  such  as 
polynomial  approximation,  evaluation  and  interpolation,  and 
numerical  quadratures  became  common  words  to  almost  anyone 
with  a college  mathematical  background.  A basic  drawback  in 
numerical  methods  lie  in  their  painstaking  concern  for 
numerical  errors,  truncation  and  round-off,  and  the  fact 
that  in  many  cases,  the  resulting  numbers  do  not  provide 
enough  information  for  interpreting  physical  phenomena  or 
gaining  significant  insight.  In  such  cases  these  methods 
have  failed,  since  as  R.  U.  Hamming  said,  "the  purpose  of 
computing  is  insight,  not  numbers". 

* Research  in  the  field  of  symbolic  and  algebraic 
manipulation  was  initiated  over  a decade  age,  based  upon 
this  spirit  as  well  as  the  general  perceived  need.  By 
dealing  mainly  with  exact  numbers,  infinite  precision 
integers  and  rational  numbers,  and  algebraic  expressions  in 
terms  of  their  symbolic  representations,  computers  an  now 
eliminate  some  of  the  sources  of  numerical  error  and  aid 


scientists  in  performing  many  non-numeric  computations. 

Many  software  systems  devoted  to  various  classes  of  symbolic 


computations  have  been  developed  since  1960  or  so.  They 
have  been  used  and  tested  in  many  different  ways,  but  they 
do  not  have  the  uide  acclaim  that  the  numerical  software  rou- 
tines have  achieved.  Various  arguments  can  be  made  to 
explain  this  - some  symbolic  packages  are  highly  specialized 
and  some  systems  are  still  in  embryonic  development  stages. 
There  is,  houever,  one  common  reason  for  the  lack  of  wide- 
spread usage  for  many  of  these  systems:  the  rapid  exhaustion 
of  the  allotted  data  storage  spaces  for  symbolic  expressions 
due  to  the  grouth  of  expressions  during  computations.  Thie 
expression  growth  problem  can  be  divided  into  two 
categories.  One  is  the  blowup  of  the  resulting  expression 
of  a computation.  A good  example  for  that  is  the 
computation  of  the  determinant  of  a matrix.  The  determinant 
of  an  n by  n numerical  matrix  is  still  a number,  but  a 
determinant  of  symbolic  matrix  may  contain  n factorial 
terms.  The  other  category  is  the  blowup  of  intermediate 
expressions  uhen  the  final  result  of  the  computation  ie 
quite  small  and  manageable.  For  this,  the  problem  of 
computing  the  greatest  common  divisor  (6CD)  demonstrates 
this  phenomenon  [KNU69].  The  GCD  of  two  integers  can  be 
easily  computed  by  the  classical  Euclidean  algorithm,  but 
the  similar  method  for  computing  the  GCD  of  two  polynomials 
in  just  one  variable  over  the  integers  can  have  enormous 
intermediate  results  even  though  the  actual  GCD  could  simply 
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be  1 [KNUB9]  CBR071] . Uhen  polynomials  in  several  variables 
are  involved  the  blowup  may  be  even  worse.  Actually,  the 
distinction  betueen  the  two  categories  is  not  as  clear  >.ut 
as  the  above  examples  have  made  it  appear.  Uhen  a 
computation  involves  many  different  steps,  blowup  of  the 
first  kind  in  any  step  can  be  considered  in  the  second 
category,  if  only  the  final  result  of  the  entire  computation 
is  important.  Thus  any  of  these  blowup  problems  will  be 
referred  to  as  the  "expression  growth"  problem. 

The  solution  to  this  common  problem  in  symbolic  manipu- 
lation seems  to  lie  in  careful  analysis  of  existing  computa- 
tional algorithms  and  discoveries  of  more  efficient  new 
methods.  (Although  faster  computers  with  more  memory  will 
also  help,  of  course.)  Much  recent  research  in  symbolic 
manipulation  has  been  directed  toward  algorithm  analysis. 
Perhaps  the  most  important  and  useful  collection  of  papers 
on  algorithms  (and  also  systems)  in  symbolic  manipulation  up 
to  1971  is  the  "Proceedings  of  the  Second  Symposium  on 
Symbol ic  and  Algebraic  tlanipulation"  [PET71J.  Many 
important  surveys,  tutorials,  and  current  research  papers  in 
all  areas  of  the  field  are  presented.  Since  polynomial  and 
rational  function  operations  and  simplification  form  the 
fundamental  basis  of  any  symbolic  manipulation  system,  they 
represent  a large  portion  of  the  papers  on  symbolic 
algorithms.  Among  these  algorithms  a noticeable  common 
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theme  is  the  use  of  modular  arithmetic  and  interpolation 
techiquas.  At  that  time,  tne  common  sentiment  seemed  to  be 
a belief  in  the  general  applicability  of  the  modular  and 
evaluation  mappings  for  many  basic  polynomial  calculations. 

It  is  one  of  the  major  goals  of  this  thesis  to  show  how 
misleading  this  idea  has  been  for  several  classes  of 
problems,  especially  the  computation  of  multivariate 
polynomial  GCD’s. 

Another  major  algorithmic  advance  in  this  fieid  has 
been  in  the  problem  of  polynomial  factorization.  Berlekamp 
[BERG71.  Zassenhaus  IZASG91 , flusser  [MU571] . Uang,  and 
Rothschild  IU8R73I  have  been  instrumental  in  making  this 
advance.  The  essential  innovation  in  accomplishing  the  task 
of  factoring  polynomials  is  another  classical  idea  of  p-adic 
constructions  in  the  Hensel’s  Lemma  [VDUA91 . It  turns  out 
that  this  method  is  not  only  applicable  to  factorization  but 
also  to  many  other  polynomial  operations,  especially  GCD’s. 
Ue  are  especially  interested  in  the  computation  of  GCD’s 
since  there  are  many  known  methods  of  performing  this  task 
and  none  seems  satisfactory  for  a uide  spectrum  of  practical 
problems. 

The  use  of  Hensel-type  constructions  in  polynomial 
factorization  uas  crucial  because  it  is  very  uneconomical  to 
perform  many  factorizations  using  the  Berlekamp  algorithm. 
The  Hensel-type  construction  takes  advantage  of  the  special 
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structure  of  a single  modular  factorization  and  extends  this 
modular  image  to  a full  factorization.  As  some  of  these 
Hensel-type  algorithms  were  being  developed,  dissatisfaction 
with  the  computing  times  of  polynomial  GCD’s  using  Known 
algorithms  was  mounting.  The  modular  GCO  algorithm,  which 
also  uses  modular  and  evaluation  mappings  similar  to  the 
factorization  algorithms,  was  performing  poorly  for  large 
class  of  practical  problems  run  on  the  symbolic  manipulation 
system  MACSYMA  IHAC73] . It  was  upon  doses'  suggestion  that 
the  similarities  between  the  problems  of  factorization  and 
GCD  computation  were  studied  and  the  Hensel-type 
constructions  were  tested  for  applications  other  than 
factorization.  Not  only  were  new  algorithms  discovered,  in 
particular,  major  improvements  of  the  Rothschi  I d-Uang 
version  of  Hensel  construction,  but  also  methods  of 
analyzing  the  algorithms  were  innovated.  Previously,  the 
analysis  of  many  algorithms  that  has  betn  geared  excessively 
toward  either  idealized  situations  or  worst-case  situations. 
Ue  will  present  a more  intuitive  approach  of  analyzing  some 
of  our  main  algorithms  for  cases  which  are  too  complex  for 
step-by-step  analysis.  Besides,  when  we  are  dealing  with 
multivariate  polynomials,  the  basic  operation  for  each  step 
may  be  very  different  so  as  to  make  the  detailed  analysis 
misleading  and  polynomials  are  often  sparse  (most  of  the 
possible  terms  are  zero)  rather  than  dense  so  as  to  make 


worst-case  analysis  inoperative.  Ue  will  show  that  our 
intuitive  "engineering"  approach  actually  produces  timing 
formulae  which  wHI  predict  the  computing  costs  to  within 
15%  in  most  ca'ses. 

With  this  brief  description  of  the  sequence  of  events 
leading  to  the  current  research,  we  will  begin  to  present  a 
more  detailed  overview  of  the  thesis  in  the  following 
section.  An  earlier  version  of  some  results  in  this  thesis, 
in  particular,  those  of  Chapter  III,  af pears  in  Hoses  and 
Yun  [M&Y73J . 


1-2  An  Overview  and  Some  Essential  Concepts 

Ever  since  the  publication  of  Euclid’s  algorithm  in 
Book  7 of  his  E I ements  (300  B.  C.)  far  computing  the 
positive  greatest  common  divisor  of  two  given  positive 
integers,  its  various  generalizations  for  computing 
polynomial  GCD’s  have  been  useful  computational  tools  for 
centuries  (as  many  number  theorists  and  algebraists  can  well 
substantiate).  However  it  was  the  onset  of  research  in 
symbolic  and  algebraic  manipulation  that  touched  off 
numerous  significant  results  on  the  theoretical  and 
mathematical  basis,  algorithmic  descriptions,  and  computer 
implementations  of  these  methods.  In  particular.  Brown’s 
IBR071I,  Collins’  (C0L67I , and  Brown  and  Traub’s  IBST71J 
works  provide  a complete  background  as  well  as  excellent 
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technical  material  on  this  subject. 

For  the  computatir;i  of  univariate  or  multivariate 
polynomial  GCO’s  ove:-  the  domain  of  the  integers  or  the 
rationals,  the  most  popular  computational  methods  are  the 
Reduced  or  Subre3ultant  PRS  algorithms  and  the  Modular 
algorithm.  He  Mill  briefly  discuss  these  methods  in  this 
section  uhich  mil  provide  a basis  for  comparing  analytical 
and  computational  results  with  those  of  the  new  Hensel-type 
GCO  algorithms  uhich  ue  ui II  present  later. 

In  this  section,  ue  Mill  give  an  overvieu  of  the  mate- 
rial contained  in  this  thesis.  Our  principal  goal  is  to 
present  several  neu  methods  of  performing  some  essential 
polynomial  operations  based  on  the  Hensel-type  construc- 
tions. One  major  application  ~f  the  Hensel  construction  is 
the  polynomial  GCO  computation  uhich  ue  ui  1 1 describe  in 
detail  in  Chapter  III.  Other  applications  ui 1 1 be  presented 
in  later  chapters,  but  the  analysis  and  empirical 
experiments  uill  concentrate  on  the  Hensel  constructions  and 
GCO  computations.  The  concept  of  these  polynomial  opera- 
tions can  be  easily  extended  to  more  general  algebraic  do- 
mains (as  Musser  did  in  IMUS71I).  But,  in  keeping  uith  the 
principles  of  symbolic  manipulation  uhere  the  stress  is  on 
exact  arithmetic,  ue  uill  b"  dealing  mainly  uith  polynomials 
over  the  integers,  Z;  the  integers  modulo  q,  Z/q,j  or  the 
rationals,  Z/Z;  in  these  domains  many  computational  advarn- 
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tages  can  be  more  easily  demonstrated. 

Since  polynomials  are  our  main  concerns,  we  first  need 
some  basic  algebraic  concepts  about  polynomials.  Only  the 
most  essential  algebraic  notions  mil  be  defined  here, 
others  Mill  be  referred  to  the  Appendix  or  texts  on  modern 
algebra  such  as  Her stein  [HER64]  or  Birkoff  and  MacLane 
IBtfl&l . 

The  domain  of  integers  uill  be  denoted  by  Z;  the  set  of 
equi valence  classes  in  Z uith  respect  to  a congruence 
relation  moaulo  some  integer  q by  CZ/q) ; arJ  the  field  of 
rational  numbers  by  (Z/Z).  The  ring  of  polynomials  with 
indeterminants  x,y,...,z  over  a coef  r ieiei/f  domain  J ui 1 1 bo 
denoted  by  J [x,y, . . . , z] . Note  that  ue  can  also  urite  this 
polynomial  ring  as  J’ [xl  uhere  now  J’  denotes  J[y,...,zl. 
This  conceptually  forces  all  polynomials  to  be  expressed 
uith  x as  the  main  variable  uith  coefficients  in  J* . 

J(x,y z]  is  an  integral  domain  if  J is  (HERG4,  p.  1231. 

In  most  cases  of  the  ‘ollouing  chapters,  us  uill  consider 
univariate  or  mu  I ti variate  polynomial  domains  I tx]  or 

I tx.y zl  uhere  I is  often  Z or  Z/q  and  q is  a prime 

m 

pouer.  Divisions  of  polynomials  F ■ f x + ...  + f x+f 

m 1 d 

n 

and  G - g x + ...  + g over  a field  to  obtain  the  quotient 
n 0 


J 


po 1 ynom i a 1 Q • q x + . . . 

+ q 

is  done  as  fol lows: 

m-n 

0 

For  k - m - n to  k - 0,  do  q 

■ f 

g and 

k 

n+k 

n 

f ■ f - q g , j - n+k-1,. 

1 1 1 ki 

Let  J be  a unique 

j j k j-k 

factorization  domain  (u.f.d.),  then  Jfx]  is  also  a u.f.d. 
whose  units  are  units  of  J.  The  process  of  polynomial 
division  with  remainder  cannot  always  be  made  in  Jfx],  since 
the  coefficient  domain  is  not  necessarily  a field  uhere 
exact  division  can  always  be  carried  out.  However,  the  pro- 
cess of  pseudo-division  can  always  be  carried  out  [KNUG91 . 
Given  F,  G in  J[x]  unique  pseudo-quotient  Q ■ pquofF  G)  and 
pseydo-remajnder  R - premtF,  G)  can  be  found  in  Jfx]  such 
d+1 

that  g F - Q G + R and  deg (R)  < deg(G)  where  g ■»  lc(G), 

the  leading  coefficient  of  G,  and  d - deg(F)  - deg(G). 

For  non-zero  F and  G in  Jfx],  F and  G are  said  to  be 

sjjmjjar,  denoted  by  F^.,  G,  if  there  exist  f,  y in  J 

such  that  f F ■ g G.  Let  F ■ F and  F • G and  assume 

1 2 

deg(F)  > deg(G),  then  a sequence  of  remainders  F ,F  ,...,F 

1 2 k 

can  be  generated  by  pseudo-division  such  that 

F r~i  prem(F  , F ) for  i ■ 3,...,  k,  and 
i i-2  i-1 


prem(F  , F ) ■ 3.  This  is  called  the  do  I unom  i a I remainder 


IE 


; 


sequence  (PRS)  [BR071J. 

Let  G ,G  , ...,G  be  elements  of  an  integral  domain. 

12  n 

Then  D is  the  greajest  common  divisor  (GCO)  of  G ,...,G  iff 

1 n 


(1)  D divides  G ,G 
1 2 


and  G . 
n 


(2)  Every  common  divisor  of  G ,G and  G also  divides  D. 

1 2 n 

(3)  D is  unit  normal.  In  a u.f.d.,  there  always  exists  a 
unique  GCD  for  a given  set  of  elements.  If  GCO  of  two 
elements  F.  G in  the  u.f.d.  equals  1.  gcdIF.G)  - 1,  then  we 
say  that  F and  G are  relativelu  prime.  A polynomial  P in 
JM  is  prjmLtive  if  the  GCD  of  all  its  non-zero 
coefficients  is  1.  Each  polynomial  P in  J[x]  has  a unique 
representation  of  the  form  P - contIP)  pp{p),  ghere  cont(P), 
content  of  P,  is  the  unit  normal  GCO  of  the  non-zero 
coefficients  of  P and  pp(P)  - P/cont(P)  Is  the  remaining 
primitive  part  of  P. 

The  process  of  computing  the  PRS  for  two  polynomials  F 
and  G in  JUI  is  a generalization  of  the  classical  Euclid’s 
algorithm  which  constructs  a Integer  remainder  sequence 
(IRS)  [KNUS9] . It  is  clear  from  the  process  of  computing 
the  PRS  (or  !RS)  that 

gcd(F  - F,  F - G)~  gcd(F  , F ) gcd(F  , F LF  . 

12  23  k-1  k k 

Thus  the  method  of  PRS  yields  the  desired  GCO  up  .o  si m i la- 
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rity.  As  pointed  out  by  Brown  [BR071J,  if  F’  and  F*  are 

1 2 


non-zero  polynomials  in  Jtx]  uith  deg(F')  > deg(F')  and  U* 

1 2 


is  their  GCD,  then  the  PRS  algorithm  can  be  used  to  determine 

this  GCD  D’  uniquely  as  follows! 

Let  c ■ cont(F’),  c - cont(F'),  c ■ gcdfc  ,c  ), 

1 12  2 12 

F - pp(F'),  F - pp(F’),  and  0 - gcdlF  ,F  ).  If  nou 
112  2 12 

F .F  F is  a resulting  PRS,  then  it  is  clear  that 

1 2 k 


D • pp(F  ) and  D’  ■ c D.  This  is  the  so  called  Euclidean 
k 

PFS  algor i thm  [C0LS7]  as  it  is  the  obvious  generalization  of 

Euclid’s  algorithm  to  polynomials  over  a u. f.d.  which  is  not 

f 

* 

necessari ly  a field. 

I 

I 

Polynomial  GCD  computations  basically  rely  on  division 

> 

or  pseudo-division  over  a u.f.d.  Thus  the  straight-forward 

I 

generalization  of  the  Euclid's  algorithm  for  integers  for 
computing  the  GCD  of  polynomials  F and  G over  a field  is 
simply: 

F ■ F,  F ■ G,  F • -Q  F + F , deg  IF  ) < deg(F  ), 

1 2 i i i-1  i-2  i i-1 

'■3 k-  For  polynomials  over  a u.f.d.,  then,  it  i9 

n&cessary  vo  use  the  PRS  generated  by  pseudo-division.  So 

we  use  the  following  rule  instead! 

5 F - -Q  F +a  F , deg(F  ) < degIF  ),  i-3 k. 

i i i-1  i i-2  i i-1 

‘ 


i 
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The  a and  0 are  particular  chosen  elements  of  the  coeffi- 


i i 


cient  u.f.d.  and  the  epeci f ication  of  thei  determines  the 
particular  PRS  algorithm  Intended  in  the  discussion. 

The  a ’g  are  simply  the  necessary  multiplier  of  pseudo- 


division. Ue  briefly  list  several  such  PRS  GCD  algorithms 

according  to  the  choices  of  0 *8.  (For  wore  detailed  dis- 

I 

cuss  ions  of  these  algorithms,  see  CBR0711 . ) 

(a)  The  Euclidian  PRS  Algorithm:  0-1,  1-3 k. 


(b)  The  Primi tive  PRS  Algor i thm: 

0 - cont (prem(F  ,F  )),  so  that  F - pp(prem(F  ,F  )), 


i-1  i-7 


i-1  1-2 


i-3, . . . ,k. 

(c)  Collins’  Reduced  PRS  Algor i thm: 

Q — 1 , — (x  , . , . , k . 

3 i i-1 

(d)  The  Subresultant  PRS  Algorithm: 

0+1  , 

1 :-2 

0 ■ (-D  , 0 ■ -f  \f  , U4,...,k  where 

3 I i-2  i 

d - deg(F  ),  6 - d - d , f - lc(F  ),  I -1 ..... k and 

i iii  i+1  i I 


i 1-  5 
i-3  i-3 

Kp  - _1,  ^ . (_f  ) ^ , i-4,...,k. 

3 i i-2  i-1 
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Among  the  above  algorithms,  the  Euclidean  PRS  Algorithm 
suffers  from  coefficient  growth  due  to  the  use  of  the  pseudo- 
division multiplier.  This  expression  explosion  can  some- 
times, especially  for  multivariate  polynomials  where  the 
multiplier  is  polynomial,  be  so  big  as  to  make  the  GCD 
computation  prohibitively  expensive.  The  Primitive  PRS 
Algorithms  requires  a content  computation  at  each  step 
of  the  algorithm  which  may  take  a significant  part  of  the 
total  computing  time,  especially  in  mul tivari ate  cases  again. 
Both  the  Reduced  and  the  Subresultant  PRS  Algorithms 
are  designed  to  avoid  this  content  operation.  Ab  pointed 
out  by  Brown  IBR0711  even  though  there  is  a trade-off  be- 
tween the  cost  of  computing  the  content  at  each  step  and  the 
advantage  of  having  possibly  smaller  coefficients  to  work 
with,  in  most  cases  the  compensation  is  very  little. 

The  Subresultant  PRS  Algorithm  is  a further  improvement  of 
the  Reduced  PRS  Algorithm  in  that  it  even  avoids  gross  inef- 
ficiencies when  the  PRS  is  abnormal  (when  in  a PRS  F 

1 

F , there  is  some  i > 1,  deg(F  ) - deg(F  ) > 1). 
k i i+1 

As  an  illustration  of  the  severity  of  the  growth  of 

the  coefficients  in  the  PRS,  let  us  consider  univariate 

polynomials  where  N bounds  the  absolute  magnitude  of  the 

coefficients  in  i-  and  F . Let  N**a  bound  the  coefficients 
1 2 i 


V ‘ 


-— 


1 


of  F . Then,  for  the  Euclidean  PRS  Algor i the  with  a normal 
j 


PRS,  ue  have  a • 2 a +a  aea  recurrence  relation  for 

j H i-2 


the  sequence  la  I.  Thus  a /a  — > 2.414  as  j becomes 
j i+1  j 


large.  Therefore,  if  deg(F  ) - n + 1 and  deg  IF  ) - n,  then 

1 2 


the  coefficient  growth  in  the  PRS  is  like  N**(2.414**n) 


uhich  is  an  exponential  growth  in  the  number  of  digits  of 


the  coefficients.  For  Collins'  Reduced  PRS  GCD  Algorithm 


with  normal  PRS,  Knuth  [KNUG91  shows  that  the  coefficient 


growth  is  proportional  to  N uhich  is  only  a linear 


growth  in  the  number  of  digits.  Nevertheless,  the  growth  is 


still  very  large  Isee  [KNU691  and  IBR071J  for  some  exam- 


ples), h specially  uhen  the  GCD  itself  is  relatively  small. 


say  1. 


Ue  now  turn  our  attention  to  an  algorithm  uhich  clearly 


avoids  this  basic  difficulty  - the  Nodular  GCD  Algorithm. 


The  Nodular  GCD  Algorithm,  as  well  as  many  other  modu- 


lar algorithms,  basically  relies  on  tuo  common  forms  of  alge- 


braic homomorphi sms: 


(1)  The  modu I ar  homomorphism,  for  integers  modulo  a prime  p. 


(a)  Computing  a homomorphic  image  - getting  the  residue 


of  an  integer  modulo  p. 


(b)  Inverting  the  mapping  - applying  the  Chinese  Re- 


■ , m , ■ i ifnii  . ryvti  tumitoi 


1. . . ' i 


mainder  Theorem  (Garner’s  rule)  to  the  residues 


u.r.t.  (with  respect  to)  different  primes. 


(2)  The  evaJjjaJj_OQ  homomorphi sm,  for  polynomials  in  x 


modulo  x-b. 


(a)  Computing  a homomorphic  image  - evaluating  the 


polynomial  at  a point  b. 


(b)  Inverting  the  mapping  - interpolating  a polynomial 


from  its  values  at  different  points. 


Here,  ue  will  say  that  the  prime,  p,  or  the  evaluation 


value,  b,  is  Jucky  for  a given  polynomial  if  the  homomorphic 


images  of  two  distinct  divisors  of  the  given  polynomial  do 


not  become  identical  in  the  image  domain.  (See  also  (BR071)) 


The  modular  GCD  Algorithm  basically  applies  the 


fundamental  concepts  of  these  two  homoworph i sms  on  the 


integral  coefficients  and  recurses  on  each  of  the  variables 


in  the  given  polynomials.  Uithout  going  into  the  details  of 


the  flodular  Algorithm  (for  that  see  IBR071J),  ue  list  the 


steps  of  a general  modular  algorithm  uhich  is  used  for  each 


variable  and  the  integral  coefficients: 


(1)  Estimate  the  number  of  homomorphic  images  required. 


(2)  Compute  a neu  homomorphic  image. 


(3)  Solve  the  problem  in  the  image  domain. 


(4)  Apply  the  inverse-mapp .ng  algorithm  to  include  this  neu 


information. 


LX 


(5)  Test  if  sufficient  number  of  image  solutions  have  been 


computed.  If  not  go  back  to  (2).  If  so,  verify  the 
current  result  and  exit  with  it  if  correct. 

Brown  (3R0711  performed  a "worst  case"  computing  time 
analysis  which  indicated  that  the  Modular  GCD  Algorithm 
uould  outperform  the  Subresultant  Algorithm  for  sufficiently 
large  problems.  (Here,  "large"  often  means  dense  or  nearly 
dense  polynomials  of  high  degree  in  many  variables.) 

Collins  [C0L711  presented  a similar  timing  analysis  and  gave 
some  empirical  computing  results  uhich  demonstrated  the 
superiority  of  the  Modular  polynomial  resultant  algorithm  to 
the  Reduced  PRS  resultant  algorithm. 

Practical  experiences  with  the  Modular  GCD  and 
resultant  algorithms  have  shown  that  in  some  cases  these 
algorithms  take  a disastrously  long  time,  often  much  longer 
than  the  comparable  Reduced  PRS  algorithms.  The  reason  for 
this  is  that  the  Modular  algorithms  are  geared  to  the 
"uorst"  cases  and  are  remarkably  insensitive  to  the 
sparseness  of  the  polynomial  inputs  and  outputs.  To 
illustrate  this  insensitivity,  we  give  a simple  case  which 
should  be  familiar  to  anyone  uith  some  background  in 
numerical  analysis.  Any  interpolation  technique  used  to 
compute  the  polynomial  x**20  + 1 requires  21  values  of  this 
polynomial  at  21  distinct  points,  even  though  there  are  only 
two  terms  in  the  polynomial.  Thus  a modular  algorithm  which 
inherently  assumes  denseness  of  the  polynomials  might  be 


iHTi*  iif'rwia 
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inferior  to  a non-modular  one,  if  the  solution  by  the 
e/aluation-interpo!ation  process  is  in  fact  x**20  + 1.  A 
completely  dense  polynomial  in  v variables  of  degree  d in 
each  has  (d  + l)$*v  terms.  This  number  is  an  exponential 
function  of  the  number  of  variables. 

If  the  GCO  of  tuo  polynomials  is  a multivariate  polyno- 
mial in  v variables  of  degree  d each,  then  the  number  of 
evaluations  and  interpolations  performed  by  the  Modular  GCO 
Algorithm  is  at  least  (d  + l)**v,  regardless  of  the  number 
of  terms  in  the  original  polynomials  and  in  the  GCQ.  Thus 

10  10  16 

if  the  GCD  of  tuo  polynomials  is  x + x + ...  + x 

1 2 10 


(e.g.  P - this  polynomial  and  Q - x PI,  the  Modular 
Algorithm  uould  take  days  or  ueeks  to  compute  it  on  existing 
machines.  On  the  other  hand,  the  Reduced  PRS  GCD  algorithm 
might  take  less  than  a second  of  computing  time. 

The  Modular  algorithm  performs  at  its  best  uhen  the 
original  polynomials  are  univariate  or  uhen  the  resulting 
GCO  is  1,  since  this  cass  involves  feu  evaluations  and 
univariate  modular  GCD  calculations  and  no  interpolation. 

The  Reduced  or  Subresultant  Algorithms  are  at  their  best 
uhen  the  input  polynomials  are  very  small  (e.g.  polynomials 
cf  degree  2 or  less  in  each  variable)  since  they  require 
little  overhead,  or  uhen  the  GCO  is  almost  as  large  as  the 
original  polynomials  (as  is  the  case  for  P and  Q defined 
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above).  This  is  the  case  when  the  number  of  polynomial 
remaindering  operations  required  to  obtain  the  GCQ  is  small, 
say  1.  The  Modular  algorithm  degrades  as  an  exponential 
function  of  the  number  of  variables  and  the  logarithm  of 
their  degrees  in  the  GCD.  The  Reduced  or  subresultant 
algorithms  degrade  quite  rapidly,  though  the  analysis  here 
is  not  nearly  as  complete  or  elegant.  The  degradation  is 
roughly  a exponentially  growing  function  of  the  length  of 
the  polynomial  remainder  sequence  ( [BR07U  , [CCR.G7] ) 
required  to  obtain  the  GCO.  For  these  two  algorithms,  small 
ansuers  (e.g.  1)  require  the  most  work. 

According  to  Brown's  analysis,  the  computing  time  for 
the  Reduced  PRS  GCD  Algorithm  is  proportional  to 
2 

2 4v  2v  v 

L (d  + 1)  2 3 where  L is  the  maximum  integer  length 

of  the  coefficients  and  d is  the  maximum  degree  in  all  v 
variables.  The  computing  time  for  the  Modular  GCO  Algorithm 
2 v v+1 

is  proportional  to  L (d  + 1)  + L (d  + 1)  . Comparing 

these  computing  times,  we  can  see  the  important  feature  of 
the  Modular  algorithm.  That  is,  as  the  number  of  terms  in 
the  tuo  input  polynomials  grous  toward  the  exponential  num- 
ber of  terms  in  a dense  polynomial,  the  Modular  algorithm 
can  obtain  the  ansuer  much  faster  than  the  Reduced  algorithm 
and  even  faster  than  a standard  multiplication  or  division 


of  the  two  polynomials  which  would  require  on  the  order  of 


(d  + 1)  steps.  In  practice,  however,  one  is  hardly  likely 
to  ever  confront  polynomials  so  large  and  dense  that  their 
multiplication  will  be  slower  than  their  GCD  calculations  by 
the  Nodular  algorithm.  To  offset  this  potential  advantage 
of  the  Nodular  algorithms  is  the  fact,  already  noted,  that 
they  are  insensitive  to  the  possibility  of  sparseness  in  the 
original  polynomials  and  the  results.  Polynomials  in  many 
variables  of  high  degree  are  invariably  sparse  in  practice, 
and  in  fact,  exceedingly  sparse.  Furthermore,  if  one  is 
going  to  deal  with  truly  gigantic  polynomials  then  there 
exist  "fast"  polynomial  mul tipi ication  algor i thms  [B0N73] 
uhich  would  lower  the  cost  of  polynomial  multiplication  (and 
division)  to  a level  which  is  close  to  that  of  the  Nodular 
algorithm.  This  would  improve  the  computation  cost  for  any 
algorithm  requiring  multivariate  multiplication  and  division 
in  the  dense  cases. 

Since  not  cns  of  these  algorithms  performs  well  on  the 
whole  spectrum  of  input  polynomials,  there  is  much  room  for 
improvement.  The  EZGCD  (Extended  Zassenhaus  GCD)  Algorithm 
presented  later  appears  to  outperform  the  Nodular  GCD 
Algorithm  in  most  practical  situations.  It  will  not 
outperform  the  Reduced  or  Subresultant  Algorithms  in  those 
situations  when  these  algorithms  are  at  their  best.  It 


1 


does,  houever,  give  a much  better  accounting  of  itself  in 
such  cases  than  the  Modular  algorithm  would. 

As  will  be  seen,  situations  which  may  present  some 
difficulty  to  the  E2GCD  algorithm  are  those  where  the  GCD  is 
not  relatively  prime  to  either  of  the  co-divisors  (or 
cofactors,  i.e.  the  quotients  of  original  polynomials  by  the 
GCO)  of  the  input  polynomials.  Then  the  EZGCO  algorithm 
will  resort  to  3 variant  of  a square-free  decomposition  of 
the  input  polynomial.  Additional  difficulties  arise  when  no 
variable  has  a constant  leading  coefficient  in  either  of  the 
input  polynomials.  And  the  most  serious  difficulty  will  be 
cases  where  no  variable  can  have  zero  as  an  evaluation 
point. 

The  essential  difference  between  the  EZGCD  algorithm 
and  the  Modular  GCO  algorithm  is  in  the  treatment  of  the 
multivariate  case.  Uhen  the  Modular  algorithm  is  at  its 
best  (univariate,  small  GCO,  or  dense  inputs  and  outputs), 
the  performance  will  usually  be  comparable. 

As  we  mentioned  before,  the  new  methods  for  computing 
polynomial  GCO  and  performing  several  other  polynomial 
operations  are  based  upon  the  Hensel-type  constructions.  It 
turns  out  that  these  Hensel-type  methods  are  closely 
analogous  to  the  evaluation-interpolation  methods  of 
numerical  analysis  and  the  above  described  modular  methods. 
As  numerical  analysis  draws  from  the  mathematical  field  of 


. 


real  and  complex  analysis  for  its  computational  methods  and 
theoretical  supports,  it  appears  that  symbolic  manipulations 
neu  insights  from  modern  algebra,  in  particular,  the  studies 
of  geometric  behavior  of  polynomials  - algebraic  geometry. 

The  classical  Hansel's  Lemma  has  Its  fundamental 
importance  in  the  studies  of  p-adic  analysis  and  valuation 
theory.  The  use  of  this  constructive  method  in  the  problem 
of  polynomial  factorization  uas  first  suggested  by  H. 
Zassenhaus  [ZAS691 . Husser  [HUS71]  presented  this  method 
and  the  Zassenhaus'  quadratic  extension  technique  as 
algorithms  for  extending  factorizations  of  polynomials  in 
general  algebraic  domains.  Some  computational 
inefficiencies  result  from  considerations  in  such  a general 
context  especially  for  multivariate  polynomials  over  the 
integers.  A generalization  of  the  Hensel  algorithm  based  on 
the  constructive  proof  of  the  Hensel 's  reducibility 
criterion  [VQU49]  uas  devised  by  Uang  and  Rothschild  (USR73J 
in  ths  context  of  factoring  multivariate  polynomials.  A 
(v+l)-var iable  polynomial  can  be  considered  as  a generalized 
Taylor  series,  as  ue  uill  see  shortly.  The  generalized 
Hensel  construction  is  devised  to  recover  the  codivisors  of 
a given  polynomial  in  the  generalized  Taylor  series  form 
uith  one  main  variable  and  a v-dimensional  coefficient 
space.  This  is  a distinct  point  of  vieu  from  the  original 
Uang  and  Rothschild  version  and  our  discussions  of  the 
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generalized  Hensel  construction  ui  I I be  presented  from  thie 
neu  and  different  standpoint  uhich  ui I I provn  to  be  very 
bene  f i c i a I . 

The  method  essentially  amounts  to  rediscovering  the 
generalized  Taylor  series  forms  of  the  divisors  term  by  term 
from  their  linearly  independent  "values"  in  the  univariate 
polynomial  space  and  the  original  polynomial.  Such  a 
process  is  quite  a familiar  one  to  many  algebraic  geometers 
as  ue  discovered  in  the  course  of  our  uork.  The 
"evaluation"  stage  is  known  as  the  "specialization"  process 
and  the  reconstruct itn  method  ie  often  called  the 
"deformation"  process  (one  probable  reason  for  calling  it 
such  is  because  the  polynomial  and  its  factors  are  now 
transformed  to  Taylor  series  forms  corresponding  to  the 
specific  chosen  values,  hence  deformed  to  a space  with  a neu 
origin).  Interestingly  the  entire  process  of  specialization 
and  deformation  is  known  as  the  "Newton’s  method".  As  ue 
uill  see  later,  the  parallelism  between  the  evaluation- 
interpolation  method  and  this  specialization-deformation 
Newton' e method  ie  quite  striking.  However,  ue  should  note 
that  the  role  of  Hensel 's  Hemma  in  algebraic  geometry  was 
pointed  out  to  us  very  late  in  our  research. 

Let  us  first  get  a good  conceptual  understanding  of  the 
generalized  Taylor  series  representation  of  multivariate 
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polynomial  with  Integral 

in  Z(x,y y ],  and  for  a 

1 v 

b ) , we  denote 

1 v 

y -b  ) or  P(x,b  ,...,b  ) by  P (x) . Ue  will 


polynomials.  For  a multivariate 

coefficients,  P(x,y  ) 

1 v 

given  set  of  integers  b - (b 

P (x, y «b 
1 

also  write  P in  a general  Taylor 

P(x,y ) - P(x,b b ) 

1 v 1 v 


v 


i-1  j-1  k-1 


+ • • • 

This  series  is  finite  and  unique 
since  Pisa  polynomial.  Ue  let 


v b 
series  form: 


(ail  terms  of  degree 
1 in  some  y) 


(y  - b ) (all  terms  of 
j j total  degree 

2 in  the  y’s) 


- b ) (y  - b ) (y  - b ) 

i j j k k 


for  a given  set  of  b’s. 


v 


v 


i 

■ 

j 

! 


(8) 

P (x.y 


y ) - P(x,b 


b ) - P (x)  and 
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(m) 

P (x,y 

1 


V M 

y ) ■ \ ^ p <x) 

v i .....  i 

i i ««*f  i *1  1 l j *1  j j 

1 m 

n 


(y  - b ) 
i i 


Then  Pix 


posi t i ve 


.y y ) 

1 v 
integer  n. 


y ) , for  some 
v 


If  we  define  "Im"  to  be  a typical 
(m) 


i ndex 


3et  <•  ' • in  P where  for  j - 1,2 m,  i is  in 

1 m j 

*1.2 vl  and  "Mm (y y )"  to  be  the  monomial  of  the 

1 v 

ID 


(y  - b )f9  indexed  by  Im,  flmiy  ,...,y  ) -n — T(y  - b ), 

' ' 1 v I I i i 

j J j-1  j j 


<m) 


then  P 


■ / P (x)  Hm(y ) and 

1 ' Im  1 v 

Im 


P(*.y I ■/  \ ^ P tx)  Tim (y  ).  The  finite 

1 v Im  J'  1 I in 
m-0  Im 


1 


degree  bound  n is  determined  by  n - max  td  where  td  is  the 

i i i 


term  degree  of  the  iih  term  in  the  complete  expansion  of  P, 
that  is  the  sum  of  the  degrees  of  the  non-main  variables 
y ,...,y  in  the  ith  term. 


i.  1 1 n n « hip  wji 
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For  S - (y  -b -b  I , we  can  extend  the  notion  of 

11  v v 

modular  congruence  to  multivariate  polynomials  with  respect 

to  a set  of  integers  b.  Thus  P (mod  S)  will  simply  be  P (x), 

b 


i.e.  dropping  all  terms  in  the  Taylor  series  form  of  P which 


2 (0)  (1) 

contain  any  element  of  S|  P (mod  S ) - P + P j and,  in 


m-1 


general,  P (mod  S 


■-Z 


(i) 


P (x,y y ).  Therefore,  due 

1 v 


i *0 


n+1 


to  the  finitenes.T  of  the  series  form  of  P,  P (mod  S ) is 
s imply  P written  in  the  general  Taylor  series  form  with  res- 
pect to  b,  so  that  it  is  equal  to  P independent  of  any  chosen 


b.  For  J - Z (y  ,..,,y  ] or  (Z/q)  [y y ],  it  is  a 

1 v 1 v 


direct  extension  of  previous  notation  to  denote  the  space 


m m 

of  all  polynomials  modulo  S by  J/S  . As  an  example  of  such 


a generalized  Taylor  series  representation,  we  take  a tri- 
variate polynomial 


2 2 2 2 2 
P(x,y  ,y)«y  x +(-y  y - y y + y y + y +2y)x 
12  1 12  12  12  1 1 


3 3 2 

+ y y - y - y y - y y + 2y,and 
12  2 12  12  2 


b ■ lb  , b I - (1,  0} . Then  P (x)  - x + 3 x, 
1 2 b 


and  in  the  Taylor  series  form,  which  is  typically  obtained 
by  a Hensel-type  construction, 

(1)  (2)  (3)  (4) 

P(x,y  , y ) - P Cx)  + P + P + P +P  where 
1 2 b 

(1)  2 

P (x,y  ,y  I • 12  x - 3 x)  (y  - 1)  + 0 ty  - 0), 

12  1 2 

(2)  2 2 
P (x,y  ,y  ) - (x  + 1)  (y  - 1)  + t-x  - 3)  (y  - 1)  y 

12  1 12 

2 

+ (-*)  y , 

2 

(3)  2 2 

P (x,y  ,y  ) ■ (-x  — 1)  (y  - 1)  y + t— x)  ty  “ 1)  y , 

12  12  12 

(4)  3 

P (x.y  ,y  ) - 1 (y  - 1)  y . 

12  1 2 

Clearly  the  term  degree  bound,  n,  ie  4 for  this  P. 

S » ly  - 1,  y 1 and,  for  instance, 

1 2 

1 

2 (01  (1)  2 2 1 

P (mod  S ) - P + P - (x  + 3x)  + (2  x - 3x)  (y  - 1). 

1 

The  above  concepts  and  notations  will  be  of  crucial 
importance  to  our  discussion  of  the  various  Hensel 

■ 

constructions  in  Chapter  U.  There,  we  will  present,  first, 

Hensel’s  Lemma  itself,  then  the  quadratic  extension  method 
due  to  Zassenhaus  for  the  univariate  polynomials.  Section 
1 1-4  contains  the  detailed  descriptions  of  the  multivariate 


A 
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Hensel  construction  - the  Generalized  Hensel  Algorithm. 
Finally,  in  Section  11-5,  we  discuss  the  uses  of  these 
Hensel  constructions  over  the  domain  of  integers  which  leads 
naturally  to  a discussion  of  the  difficulty  and  solution  of 
the  non-trivial  leading  coefficient  problem  in  this  case. 
Some  simple  but  comprehensive  examples  are  carried  out  to 
help  give  a clearer  understanding  of  these  constructive 
methods  which  form  the  foundation  of  all  the  new  algorithms 
presented  later. 

Chapter  III  contains  the  new  algorithms  for  computing 
univariate  or  multivariate  polynomial  GCD’s  using  the  Hensel 
constructions.  Ue  will  emphasize  the  multivariate 
algorithm,  EZGCD,  since  it  provides  an  efficient  method  for 
the  large  class  of  intermediate-sized  problems.  The  general 
set-up  of  the  new  method  of  calculating  GCD’s  and  the 
fundamental  theorems  proving  the  validity  of  this  method 
will  be  presented  for  the  multivariate  case  only,  since  the 
univariate  case  obeys  the  same  rules.  In  Section  IIJ-3,  we 
present  the  UNIGCD  Algorithm  which  uses  the  Zassenhaus’ 
Quadratic  Extension  Algorithm  to  compute  univariate  GCD’s. 
Then  the  main  application  of  the  Hensel  construction  in  this 
thesis  - EZGCD  Algorithm,  is  finally  discussed  in  full 
detail  in  Section  111-4.  The  solution  to  the  non-trivial 
leading  coefficient  problem  for  the  computation  of  GCD’s 
also  appears  in  detail  in  this  section. 


Z+ 

The  timing  analysis  of  many  algorithms  in  Chapters  II 
and  III  are  collected  in  Chapter  IV.  Ue  analyze  the 
computing  costs  of  the  Hensel-type  algorithms,  argue  the 
case  for  our  more  intuitive  timing  formula  for  the 
multivariate  Hensel  construction.  Then  the  computing  cost 
of  the  EZGCO  Algorithm  is  derived  and  it  is  3houn  that  the 
dominating  cost  here  is  in  the  Hensel  construction.  These 
derived  computing  time  formulas  are  verified  by  timing  many 
cases  of  GCD  computations.  These  empirical  results  strongly 
suggest  the  validity  of  the  derived  formulas  since  the 
actual  ccmputing  times  ordinarily  correspond  to  the  formula 
predictions  to  uithin  15%  and  frequently  much  closer. 

The  next  three  chapters  contain  three  other  useful 
applications  of  the  Hensel  constructions  in  algebraic 
computations.  Again  ue  concentrate  on  the  multivariate 
cases  , since  the  univariate  cases  are  much  less  complex  so 
that  they  can  essentially  be  considered  special  cases  of  the 
multivariate  methods. 

Chapter  V outlines  the  polynomial  factorization 
algorithms  using  the  Zassenhaus’  Quadratic  Extension 
algorithm  in  the  univariate  case  and  a neu  and  more 
efficient  multiple  factor  generalized  Hensel  construction 
for  the  multivariate  case.  Chapter  VI  shows  how  ue  can  take 
advantage  of  the  univariate  images  of  the  codivisors  of  one 
of  several  given  polynomials  and  apply  the  Hensel 


construction  only  once.  Thu9,  polynomial  contents  and 
primitive  parts  or  the  GC0  of  more  than  two  polynomials  can 
be  computed  in  a 9emi-paral lei  way.  In  Chapter  VII,  we 
point  out  some  of  the  neff iciencies  in  the  known  methods  of 
computing  square-free  decompositions  of  multivariate 
polynomials.  Then  we  will  generalize  a method  already 
introduced  as  the  special  case  algorithm  for  EZGCD,  and  get 
still  another  application  of  the  Hensel  construction  in  a 
new  9quare-free  decomposition  algorithm  - EZSQFR. 

Finally,  in  Chapter  VIII  we  summarize  this  research, 
point  out  several  more  potential  applications  of  the  Heneel- 
type  constructions,  and  indicate  some  open  research  problems 
of  interest  in  th i s direction. 


It 
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CHAPTER  II.  THE  HENSEL  ALGORITHMS 
II  - 1 Introduction 

In  this  chapter,  ua  will  discuss  the  Hensel-type  algo- 
rithms for  polynomials  over  Z,  or  Z/q  in  particular  those 
where  computational  advantages  can  be  most  easily  demon 
strated.  Because  of  the  importance  of  these  Hensel  construc- 
tions for  the  discussion  of  all  the  other  material  in  the 
following  chapters,  some  intuitive  feeling  as  well  as  a 
conceptual  grasp  of  these  methods  is  imperative.  To  aid  in 
achieving  this  understanding,  we  will  intersperse  numerous 
examples  among  the  discussions.  Also,  for  theoretical  and 
pedagogical  purpose*?,  we  compare  the  algorithms  with  a 
purely  integral  version.  Analyses  of  these  Hensel-type  algo- 
rithms will  be  done  in  Chapter  IV,  where  we  will  also 
present  some  empirical  results  to  justify  our  arguments  and 
the  computing  time  formulas. 

II  - 2 Tuo  Basic  Supporting  Algorithms 

Lemma  1 1-2. 1:  Lst  Fix)  and  GIx)  be  in  (Z/p)  lx]  where  p 

is  a prime  in  Z.  If  gcdIF.G)  ; D(x)  (mod  p) , then  there 
exist  unique  (up  to  units)  A(x)  and  B(x)  in  (Z/p)  tx]  such 
that  A(x)  F(x)  + B(x)  G(x)  ■ 0(x)  (mod  p)  where 
deg (A)  < deg(G)  - deg(O)  and  deg(B)  < deg(F)  - deg(O). 

Proof:  (Z/p)  tx]  is  an  Euclidean  ring,  so  that  the 

Preceding  page  blank 
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Extended  Euclidean  algorithm  applies.  A version  of  this 
algorithm  for  the  domain  of  integers  is  presented  by  Knuth 
(KNU69J  and  can  easily  be  generalized  for  (Z/p)  [x] . 

Assume  there  exist  A’  and  B'  in  (Z/p) lx]  such  that 
A’F  + B’G  « 0 (mod  p)  and 

deg(A’)  < deg(G)  - deg(D),  deg(BM  < deg(F)  - deg(D).  Then 
(A  - AM  (F/0)  - (B*  - B)  (G/D) ..  Since  (F/D)  and  (G/D)  are 
relatively  prime,  (F/0)  divides  B - B*.  But 
deg(B  - BM  < deg(F)  - deg(O),  thus  B - B’  ; 0 or  B ; B’. 
Similarly  A - A'.  The  A and  B so  found  by  the  Extended 
Euclidean  algorithm  must  then  be  unique  in  (Z/p)(x).  // 

Example  1 1-2. la:  As  a simple  example,  let  F(x)  - x, 

G (x)  - x + 3,  and  p - 7 then  clearly  (-1)  x + 1 (x  + 3)  - 3 
or  2 x + (-2)  (x  + 3)  ; 1 (mod  7)  so  that  A - 2,  B - -2, 
and  D - 1. 

Coro  1 1 ary  11-2.1:  If  p in  the  above  lemma  is  not  a 

prime,  but  all  prime  divisors  of  p are  "lucky"  (see  IBR071J 
or  Section  IV  - 2),  then  the  conclusions  of  the  above  lemma 
still  hold. 

Proof:  Since  p is  lucky,  the  Extended  Euclidean 

algorithm  still  applies  (refer  to  Theorem  1 of  [BR071J).  It 
is  also  clear  that  the  proof  for  uniqueness  of  A and  B 
holds.  // 

Since  (Z/p)  lx]  is  a Euclidean  ring,  regular  division 
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of  polynomials  with  remainder  can  be  carried  out.  Given 

polynomials  F - F and  F - G in  CZ/p)  [x] , the  PRS, 

1 2 

F ,F  ,...,F  can  be  found  together  with  the  quotients  Q 
12k  I 

such  that  F -Q  F + F . i • 3, . . . ,k.  Then 
i-2  i i-1  i 

0(x)  ■ F (x)  and  A(x),  B(x)  are  constructed  from  the  Q(x)’e 
k 

as  specified  in  the  Extended  Euclidean  algorithm.  Ue 
present  a tabulated  version  of  this  algorithm  to  ma..a  the 
algorithm  easier  to  understand. 

Algor i thm  1 1 -2.1: 

Input:  F and  G in  IZ/pHx]. 

Output:  A,  B,  and  D in  (Z/p) [xl  such  that 

A F + B G - 0 (mod  pi 


0 

A 

3 

1 

< 

B 

3 

F F 
3 A 

A ■ 1,  A ■ 8,  B ■ 0,  B ■ 1,  and  A ■ A - Q A 
1 2 1 2 i i-2  i i-1 

B - B - Q B , i ■ 3 k.  That  is,  as  ue  compute  Q 

i i-2  i i-1  i 

and  F by  the  division  algorithm,  ue  can  fill  In  the  values 
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of  A and  B by  subtracting  the  product  of  the  current  quo- 
i i 

tient  and  the  previous  A or  B from  the  one  A or  B before. 

FinallyA«A,B«8,  and  0 ■ F uhere 
k k k 

A F + B G = D (mod  p). 

3 

Example  1 1 -2. lb:  Let  Fix)  ■ x +2, 

2 

G(x)  - x + 2 x - 2,  and  p ■ 5.  By  division,  ue  get 

F ■ Q G + F • (x  - 2)  G + (x  - 2)  and 

3 3 

G-Q  F +F  ■ (x  - 1)  F +1.  Thus  the  tabulated  solu- 

4 3 4 3 

tions  are  as  follous: 


x-2 

x— 1 

1 

0 

1 

-x+l 

0 

1 

-x+2 

2 

x +2x-2 

x-2 

1 

2 

So  A * -x  + 1,  B«x  + 2x-2,  and  D ■ 1.  It  is  easy  to 
verify  this  result,  and  clearly  A F + B G - D (mod  p)  with 
deg (A)  - 1 < deg(G)  ■ 2. 

Remark:  8y  the  Corollary,  p may  be  a composite  integer 

and  the  algorithm  will  still  be  valid  so  long  as  I c (F)  and 
lc(G)  are  units  in  (Z/p). 

Lemma  11-2.2:  For  non-zero  Fix)  and  G(x)  in  (Z/pHx), 


and  any  H(x)  such  that  gcd(F,G)  - D(x)  uhich  divides  H(x) 


(mod  p),  there  exist  unique  A(x)  and  B(x)  in  (Z/p)  tx]  such 
that  A F + B G ; H (mod  p)  uhere  deg  (A)  < deg(G)  - deg(D). 

Proof:  By  Lemma  1 1-2.1,  we  can  find  A’,  B’  in  (Z/p)  tx] 
such  that  A’  F + B'  G ;0  (mod  p)  uhere 
deg(A’)  < deg(G)  - deg(O),  deg(B’)  < deg(F)  - deg(D). 

Let  C (x)  - H(x)/D(x) , then  (C  A’)  F + (C  BM  G - H (mod  p) 
or  (C  A’)  (F/0)  + (C  B’)  (G/0)  - C (mod  p).  Apply  the 
division  algorithm  to  C A'  and  G/D  so  that 
C A'  ■ Q (G/D)  + R where  deg(R)  < deg(G)  - deg(D).  Let 
A - R and  B - C B1  + Q (F/D).  Then  A (F/D)  + B (G/D)  - C or 
A F + B G - H (mod  p).  Assume  there  also  exist  A"  (x) 
and  B"(x)  in  (Z/p)  tx]  satisfying  A"  F + BM  G • H (mod  p) 
and  the  degree  constraints.  Then 

(A  - A")  (F/D)  - (B"  - B)  (G/D)  (mod  p)  ....  (Eq.  1 1-2.2). 

F/D  and  G/D  are  relatively  prime,  so  (G/D)  must  divide 
(A  - A")  (mod  p).  But  deg(A  - A")  < deg(G)  - deg(D),  there 
fore.  A - A"  - 0 (mod  p)  or  A ; A"  (mod  p).  Hence,  from 
(Eq.  1 1 -2  .2),  B i B"  (mod  p) , since  G/D  is  non-zero. 

Therefore,  the  A and  B so  found  must  be  unique  in  (Z/P)  txl . 

// 

Example  1 1 -2. 2a : Continuing  from  Example  1 1 —2. la,  assume 

2 

H(x)  • x , then  following  the  notations  in  the  proof  of 

2 

Lemma  1 1-2.2  C(x)  • H/D  » x , since  D • 1,  A*  ■ 2,  and 
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2 

8*  ■ -2.  C A1  ■ 2 x ■ Q G + R where  Q • 2 x + 1,  R ■ -3. 

Thus  A ■ -3,  0 ■ C 8'  + Q F * x,  and 

2 

A F + B G > -3  x + x (x  + 3)  « x where  deg  (A)  < deg(G). 

it  is  quite  obvious  here  that  AH  F + B"  G * H ui th 

A"  - x and  B"  - 0 also  solves  the  equation.  But  then  the 
condition  deg (A")  < deg(G)  is  not  satisfied.  In  fact, 
x - Q"  G + R"  with  Q"  - 1 and  R“  - 3.  Using  the  same 
technique,  we  can  get  R"  F + (Q"  F + B")  G • H where 
Q"  F + B"  « x and  that  is  the  same  solution  as  the  above  A 

t 

and  B.  This  is  a simple  example  showing  that  the 
equation  A F + B G - H has  many  solutions  but  division 
by  G (or  F)  can  be  used  to  obtain  a unique  solution. 

Coro  I laru  1 1 -2. 2:  If  H(x)  in  Lemma  1 1-2.2  satisfies 

! 

the  degree  constraint  deg(H)  < deg(F)  + deg(G)  - deg(D), 
then  B(x)  satisfies  a degree  constraint  similar  to  that  for 
A(x),  deg(B)  < deg(F)  - deg(D). 

Proof:  As  in  the  proof  of  Lemma,  B - C B’  + Q (F/D) 

satisfies  A F + B G - H (mod  p),  thus 
deg (B)  < deg(H  - A F)  - deg(G)  < deg(F)  - deg(D).  // 
Example  1 1 -2. 2b:  Continue  from  Example  1 1 -2. lb , we 
4 3 2 

assume  H (x)  - 2 x + x + 2x  + x + 2,  then  fol  lowing  the 
proof  of  Lemma  1 1-2.2  and  using  the  results  of  Example 
1 1 -2. lb,  C (x)  - H(x)  since  0 - 1,  A’  - -x  + 1,  and 
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2 3 

B’  - x + 2 x - 2.  C A'  « Q G + R where  Q • -2  x +1  and 

R ■ 2 x - 1.  SoA-R-2x-l,  B-CB’  + QF-2x-2, 

and  A F + B G - H where  deg (A)  < deg(G),  and  since 

deg(H)  < deg(F)  + deg(G),  also  deg(B)  < deg(H).  This  result 

will  be  used  in  Example  11-3.1  in  the  next  section. 

Remark:  In  Lemma  or  Corollary  11-2.2,  if  p is  not  a 

prime  but  lc(G)  is  a unit  in  Z/p,  then  the  conclusions  still 

hold. 

Example  1 1 -2.2c:  In  this  example,  p is  not  a prime,  but 

computations  similar  to  the  above  example  can  still  be 
carried  out.  (The  results  of  this  example  will  later  be 
used  in  Example  1 1-3.2.)  Assume  we  have 
2 2 

A’«-x-9,  B’-x  - 3 x + 3,  F - x + 10  x - 8, 

2 

G - x - 12  x + 7,  and  p - 25,  then  A’  F + B*  G ; 1 (mod  p). 
Clearly  lc(G)  - 1 is  a unit  in  (Z/25).  Let 
3 2 

H(x)  - -x  - 11  x - G x - 7 in  (Z/25)  lx]  be  given  and  we 
want  to  find  A (x) , B(x)  in  (Z/25) tx)  such  that 
A F + B G * H (mod  25).  Since  C(x)  ■ H/D  - H and 
I c (G)  - 1,  the  division  algorithm  can  be  carried  out  to 

2 

compute  C A’  - Q G + R where  Q-x  + 8x-9  and  R - 0. 

Thus  A - 0,  B-CB’  * Q F - -x  + 1,  and  we  get 
AF+BG^Hwith  them.  Therefore,  when  p is  not  a prime 
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but  I c (G)  is  a unit  in  (Z/p),  the  process  remaine  the  same  so 
long  as  the  division  algorithm  can  be  carried  out. 

The  algorithm  for  achieving  the  goals  9et  in  Lemma  II- 
2.2  i3  similar  to  Algorithm  2.7S  (Solution  of  a polynomial 
equation)  of  Musser  tnUS71) , except  ours  can  be  used  for 
general  right-hand-sides  other  than  1. 

A I qor i thm  11-2.2: 

Input:  F.  G.  A*.  8’.  D.  and  H in  (Z/p)  lx)  such  that 

A’  F + 8’  G - 0 and  D divides  H or  C - H/D  is  in  (Z/p)  tx) . 
Output:  Unique  A and  B in  (Z/p)  tx)  such  that  AF  + BG;H 

where  d'g(A)  < deg (G)  - deg(D). 

(If  deg  H)  < deg(F)  + deg (G)  - deg(D),  then  aleo 

deg(B)  < deg(F)  - deg(O).) 

(1)  Set  A <—  A*  C.  If  D f 1,  set  F <--  F/D,  G <—  G/D. 

(2)  Apply  mod  p division  algorithm  to  compute  Q and  R euch 
that  A - Q G + R,  deg (R)  < deg(G). 

(3)  Return  A < — R and  B < — C B'  + Q F (mod  p). 

11-3  Hensel’s  Lemma  - The  Univariate  Case 

Ue  will  now  present  the  Hen9el  Lemma  for  extending  a 

k 

factorization  modulo  p to  a factorization  modulo  p for  any 
k > 1.  The  presentation  is  essentially  based  on  that  of 
Van  der  Uaerden  [V0U49J.  This  Lemma  provides  the  founda- 
tion of  most  other  algorithms  ue  uill  discu99  later.  The 


proof  of  the  Lemma  is  constructive,  but  ue  ui  I I still 
formalize  it  into  an  algorithm.  The  reason  for  that  is 
because  it  is  very  important  to  get  a firm  grasp  of  the 
underlying  ideas  of  this  Lemma.  Ue  mil  not  use  the 
construction  of  the  proof  of  the  Lemma  for  computations  in 
the  univariate  case,  rather  ue  ui  1 1 use  Zassenhaus’ 
Quadratic  Extension  algorithm.  Houever,  the  construction 
for  extending  multivariate  factorizations  is  a direct 
generalization  of  this  Lemma  as  ue  shall  soon  see. 

Lemma  1 1 -3 . 1 : (Hense I ) 

Let  Fix)  be  in  Z lx)  and  p be  a prime  in  Z.  Assume 

Fix)  - G ix)  H ix)  imod  p) , uhere  G lx)  and  H (x)  are 

11  11 

relatively  prime  polynomials  in  iZ/p) [xl . Then,  for  any 

integer  k > 1,  there  exist  polynomials  G ix)  and  H (x)  in 

k k 

k 

(Z/q)  ix]  uhere  q - p such  that  Fix)  - G (x)  H (x)  (mod  q) 

k k 

and  G » G (mod  p),  H - H (mod  p). 
k 1 k 1 

Proof:  Since  G and  H are  relatively  prime  in 

1 1 

(Z/p)[x),  ue  can  find,  by  applying  Algorithm  II-2.1,  Aix), 

Bix)  in  (Z/p)  ix]  such  that  AG  + B H - 1 (mod  p)  and 

1 1 

deg  (A)  < degiH  ),  deg  (B)  < degiG  ).  From  G and  H , ue  ui  I I 

1 111 
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construct  sequences  of  polynomials 


(G  I and  (H  ) by  induction 

j j 


j 


such  that  F - G H (mod  p ) and  G ■ G , H - H (mod  p) . 

j j j 1 j 1 

j 

Assume  for  j > 1,  we  have  G (x),  H (x)  in  (Z/p  ) tx]  euch  that 


j 

F » G H (mod  p ) 


j j 

Let  C (x)  in  (Z/p) (xl  be  such  that 
j 

j j+1 

p C (x)  - F (x)  - G (x)  H (x)  (mod  p ) 

i j j 

j+1 

where  all  arithmetic  operations  are  performed  in  (Z/p  ) 

j+1 

with  G and  H considered  to  be  polynomials  in  (Z/p  ) [x] . 

j j 

j 

Note  that,  by  (») , p divides  into  F(x)  - G (x)  H (x) 

j j 

exactly.  Now  apply  Algorithm  11-2.2  on  G , H , A,  B and  C 

11  j 

in  ;z/p)  and  obtain  A (x),  B (x)  in  (Z/p)  [x]  such  that 

j j 

A (x)  G (x)  + B (x)  H (xl  j C (x)  (mod  p)  and 
j 1 J 1 ' j 

deg(A  ) < deg(H  ).  Now  let 

j 1 

j j 

G - G + p B and  H - H + p A , then 
j+1  j j j+1  j j 
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j J+l 

G H -GH+p  (AG+BH)  (mod  p ) 
j+l  j+l  j j j 1 j 1 


j j+l 

■ G H +p  C ■ F(x)  (mod  p ) 
j j j 


Because  of  this  construction, 


J 


G -G  + pB  + ...+p  B and 
j+l  1 1 j 


J 


H - H + pA  + ...+p  A , for  all  j > 1 
j+l  1 1 } 


So  clearly  G * G , H ■ H (mod  p)  for  all  j > 1 * // 

i " 1 j 1 


Remark:  If  F(x)  is  a monlc  polynomial  in  Z(xl  then 


deg(F  - G H ) < deg(G  ) + deglH  ).  So  C and,  in  turn,  C 
11  1 12  j 


for  all  j > 1 satisfy  the  condition  of  Corollary  II-2.2. 


Therefore,  we  have  deg(B  ) < deg(G  ) for  al I j > 1.  This 

j j 


uill  prove  to  be  a very  important  observation  later,  espe- 
cially for  solving  the  leading  coefficient  problem  of  the 
EZGCD  algorithm.  It  should  also  be  noted  that  the  Hensel 
construction  is  a method  for  expressing  the  factors  in  a 
p-adic  representation  or  a type  of  power  series  form.  This 


is  clearly  seen  from  the  last  expressions  for  G and  H 

j+l  j+l 


in  the  above  proof.  An  example  here  uill  make  these  points 
clearer,  and  also  demonstrate  the  constructive  nature  of  the 
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above  proof. 


Example  11-3.1:  Let  F(x)  in  Ztx]  be 


5 4 3 2 

x + 12  x - 22  x - 163  x + 309  x - 119  and  p - 5.  Then 


in  (Z/p) [k1 . F(x)  - G (x)  H (x)  where  G (x)  • x +2  and 
1 1 1 


H (x)  - x + 2 x - 2.  Let  us  carry  out  two  iterations  of 
1 

the  Hen9el  construction,  i.e.  assume  k ■ 3 and  find  G (x), 

3 


H (x)  such  that  F(x)  - G (x)  H (x)  (mod  q)  Wnere  q - p . Ue 
3 3 3 

will  cheat  a little  by  revealing  the  answers, 

3 2 

G (x)  - x - 15  x + 17  and  H (x)  • x + 12  x - 7.  The 
3 3 

reason  for  doing  this  is  to  be  able  to  see  exactly  how 
these  coefficients  are  discovered  as  the  algorithm  pro- 
ceeds. The  first  thing  to  do  is  to  compute  A(x)  and  B ( x)  in 

(Z/p)  lx]  such  that  AG  + B H ■ 1 (mod  p).  But  this  com- 

1 1 

putation  was  already  done  in  Example  11 -2. lb  and  we  have 


A ( x)  - -x  + 1 and  B(x)  ■ x + 2 x - 2.  Next  we  have 
4 3 2 

F - G H - 10  x - 20  x - 165  x + 305  x - 115  over  Z with 
1 1 


G and  H considered  polynomials  in  Z(xl.  Thus 
1 1 
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4 3 2 

C (x)  ■ (F  - G H )/p  (mod  p!  • 2 x + x + 2 x + x + 2 and 
1 1 1 

p C (x)  is  the  "residue"  of  F and  G H in  the  next  bigger 
1 1 1 


2 

polynomial  space  (Z/p  ) [x] . The  goal  now  is  to  correct  or 

add  to  G and  H what  is  missing  so  as  to  make  this  residue 
1 1 

disappear  and  to  have  G and  K such  that 

2 2 

2 

F - G H (mod  p ).  To  do  so  we  need  to  find  A (xi  and 
2 2 1 

B (x)  in  (Z/p)  lx]  such  that  A G + B U - C (mod  p) . 

1 11111 

Conveniently,  this  computation  has  also  been  done  in  Example 

1 1 -2. 2b,  with  A (x)  ■ 2 r - 1 and  B (x)  - 2 x - 2.  Thus 
1 1 

3 

G -G  +p0  -x  +10x-8  and 

2 1 1 

2 

H - H +pA  -x  + 12  x - 7.  Thus,  as  is  easily  verified, 

2 1 1 

2 

F - G H (mod  p ).  At  this  point,  note  that 
2 2 

2 2 

G is  simply  G (mod  p ) and  H is  H (mod  p ).  In  fact, 

2 3 2 3 

2 

H ■ H since  p ■ 25  is  already  bigger  than  twice  the 
2 3 


-i. 





f 

I ! 

fe 

I 


coefficients  of  H . Uhat  this  iteration  has  accoip I i shad  is 
3 

then  obvious.  For  the  second  iteration, 

2 3 2 

C (x)  « (F  - G H )/p  (mod  p)  ■ -(x  + x + x + 2). 

2 2 2 

Applying  Algorithm  1 1 -2. 2 on  F,  G,  A,  B,  and  D • 1,  we 

get  A (x)  ■ 0 and  B (x)  » -x  + 1 in  (Z/p) tx]  such  that 
2 2 

A F + B G - C (mod  p).  Then, 

2 2 2 

2 3 

G - G + p B - x - 15  x + 17.  As  we  already  pointed  out, 
3 2 2 

H * H and  surely  A • 0 so  that  H need  not  be  updated. 

3 2 2 2 

Therefore,  we  have  constructed  G and  H such  that 

3 3 

F - G H in  (Z/q)  lx] , as  we  set  out  to  do. 

3 3 

As  we  can  see  from  the  construction, 

2 2 
G - G + pB  +p  B andH  - H + pA  + p A which 
3 11  2 3 11  2 

shous  clearly  G -G  andH  «H  (mod  p).  Actually, 

3 1 3 " 1 

3 2 3 

G * x - 15  x + 17  » (1  + 05  + 05)  x + 

3 

2 2 

(0+25+  (-1)  5 ) x + (2  + (-2)  5+15)  uhich  is  putting 

the  coefficients  of  G into  p-adic  representation  with 

3 


i 


p ■ 5.  Then,  the  correspondence  of  this  representation  and 

the  above  formula  for  G as  constructed  from  G , B , and  B 

3 112 

becomes  clearer. 

Algor i thm  1 1-3.1:  (Hensel) 

Input:  F(x)  in  Ztx],  p a prime  in  Z which  does  not  divide 

I c (F) . k > 0 in  Z,  G (x)  and  H (x)  in  (Z/p)  lx]  which  are 

1 1 

relatively  prime,  and  F ■ G H (mod  p). 

1 1 . 


k 

Output:  G (x)  and  H (x)  in  (Z/q  ) Ixl  where  q - p such 

k k k k 

that  F - G H (mod  q)  and  G - G , H » H (mod  p). 
k k k 1 k 1 

(1)  Apply  Algorithm  1 1 -2.1  to  G - G and  H - H and  obtain 

1 1 

A,  B in  (Z/p)  tx]  such  that  A G + B H - 1 (mod  p). 

Set  j < — 1,  q < — p,  and  q’  < — q p. 

(2)  If  j ■ k then  output  G and  H. 

Otherwise,  set  C <--  (F  - G H)/q  and  C < — C (mod  p). 

(3)  Apply  Algorithm  II-2.2  on  G,  H,  A,  B,  and  C in  (Z/p)  lx] 
to  get  A’  and  B*  in  (Z/p) lx]  such  that 

A’  G + B’  H - C (mod  p). 

(4)  Set  q <—  q’,  q’  <--  q p,  j <--  j + 1,  and  go  to  (2). 

Ue  next  discuss  the  Zassenhaus’  quadratic  extension 

algorithm  IZASG91  IMUS7 1 1 CU&R73]  which  is  an  improved  ver- 
sion of  the  Hensel’s  method  described  above.  This  algorithm. 


because  of  its  added  efficiency  for  snail  primes  p or 
large  required  modulus  q,  is  actually  used  in  univariate 
cases  of  many  later  algorithms  for  extending  factors  from 

k 

2 

Z/p  to  Z/q,  where  q - p for  some  k > 0,  and  consequently 
to  Z (as  we  uill  see  in  Section  1 1 -5) . 

Lemma  11-3.2:  (Zassenhaus)  Let  F(x)  be  in  Ztx]  and 

p be  a prime  in  Z which  does  not  divide  the  leading  coeffi- 
cient of  F(x).  Assume  F(x)  ■ G (x)  H (x)  Uod  p)  where 

0 0 

G and  H are  relatively  prime  polynomials  in  (Z/p) lx] . 

0 0 

Then,  for  any  integer  k > 0,  there  exist  polynomials 

k 

2 

G («)  and  H (x)  in  (Z/q  ) tx] . where  q - p , such  that 
k k k k 

F(x)  ■ G (x)  H (x)  (mod  q ) and  G ■ G , H ■ H (mod  p) . 
k k k k ” 0 k ~ 0 

Proof:  I c (G  ) and  I c (H  ) must  be  units  in  (Z/p),  since 
0 0 

I c (F ) is  one.  By  Lemma  and  Corollary  11-2.1  and  using 

Algorithm  K-2.1,  we  can  find  A , B in  (Z/p)  lx]  such  that 

0 0 

A G + B H ■ 1 (mod  p)  and 
0 0 0 0 

deg  (A  ) < deg(H  ),  dog(B  ) < deg(G  ). 

0 0 0 0 

From  G , H , A , and  B , we  wl 1 1 construct  sequences  of 
0 0 0 0 


i 


S3 


polynomials  (G  I , (H  I,  (A  I,  and  IB  ) by  induction  such  that 
j j i j 

F ■ G H (mod  q ) , G ■ G , H - H (mod  p) , and 
j j j j 0 J 0 

A G + B H - 1 (mod  q ),  I c (H  ) is  a unit 
j j j j j j 

in  (Z/q  ),  and  deg  (A  ) < deg(H  ),  deg(B  ) < deg(G  ).  Assume 
j j j j j 

j 

for  j > 0 ue  have  G , H , A , and  B in  (Z/p  ) [x]  such  that 

j j j j 

F - G H (mod  q)andA  G +B  H ■ 1 (mod  q ) . 
j j j j j j j j 

Also  assume  I c (H  ) Is  a unit  in  (Z/q  ) and 

j j 

deg (A  ) < deg(H  ),  deg(B  ) < deg(G  ). 

J J j j 

Let  C (x)  in  (Z/q  ) (xl  be  ’ ' that 

j j 

q C (x)  - F(x)  - G (x)  H (x)  (mod  q ) uhere  all  arith- 
j i j j j+1 

metic  operations  are  performed  in  (Z/q  ) lx] . Nou  ue  apply 

j+1 

Algorithm  11-2.2  on  G , H , A , B , and  C in  (Z/q  ) lx]  and 

j j j j j j 

obtain  H' , G’  in  (Z/q  ) lx]  such  ttdt 

j j j 

H’  G + G’  H - C (mod  q ).  Let  G - G + q G’  and 
j j j j j j j+1  j j j 

H - H + q H’ , then 

j+1  i j j 

G H - G H + q (H*  G + G*  H ) (mod  q ) 

j+1  j+1  j j j j j j j j+1 


I.1-  m-lif 
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i G H + q C - F (k)  (mod  q ) 

j j j j j+1 

Since  I c (H  ) is  a unit  in  Z/q  and  deg(H’)  < deg(H  ), 

j j j j 

I c (H  ) - lc(H  ) is  also  a unit  in  Z/q  and 

j+1  j j+1 

deg(H  ) * deg(H  ) - deg(H  ).  Now  let  R (x)  in  (Z/q  ) [xl 

j+1  j 8 j j 

be  such  that  AG  +BH  ■ 1 + q R (Mod  q ). 

j j+1  i j+1  j j j+1 

Apply  Algorithm  II-2.2  on  G , H , A , B and  R in  (Z/q  ) tx] 

j j j j j j 

and  get  A’,  B’  such  that  A’  G + B’  H - R in  (Z/q  ) tx] 

j j j j j j j j 

and  deg (A’)  < deg(H  ).  Let  A - A - q A’  and 

j j j+1  j j j 

B - B - q B’ , then 

j+1  j j j 

A G + B H 

j+1  j+1  j+1  j+1 

-AG  + B H - q (A’  G + B’  H ). 

j j+1  j j+1  j j j j j 

-1+q  R -q  (R  ) - 1 (mod  q ) 

j j j j j+1 

and  deg (A  ) - deg (A  ) < deg(H  ) « deg(H  ),  hence, 

J+1  j j j+1 

deg(B  ) < deg(G  ),  because  I c (H  ) is  a unit.  By  this 

j+1  j+1  j+1 

construction,  G -G  +q  G’+...+q  G’  and 

j+1  0 0 0 j j 

H - H + q H’  + ...  + q H’,  so  that  clearly  G - G 
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and  H ■ H (mod  p)  for  all  j > 0.  II 
j 0 

Remarks  If  F(x)  is  a monic  polynomial  in  Z(x],  then 

deg(F  - G H ) < deg(G  ) + deg(H  ),  so  that  C , and  in  turn, 
0 0 0 0 1 

C for  all  j > 1 satisfy  the  condition  of  Corollary  II-2.2. 


Thus  we  must  have  deg (B* ) < deg (G  > in  addition  to 

j j 

deg (A’)  < deg(H  ). 

j j 

Example  1 1-3.2:  As  an  example  for  this  quadratic 
construction  ue  take  the  same  F and  p from  Example  II-3.1, 

3 2 

so  that  ue  have  G ■ x +2  and  H ■ x + 2x-2  such  that 

0 0 

F - G H (mod  q - p - 5)  as  before.  Ue  want  to  carru  the 
0 0 0 

construction  to  k ■ 2 or  q ■ 625  which  is  already  greater 


than  125  bounding  twice  (since  ue  use  positive  and  negative 

numbers  centered  at  0 as  residue  class  representatives  in 

any  Zlq)  the  magnitude  of  coefficients  in  G and  H of 

3 3 

Example  1 1-3.1.  Thus  our  goal  is  still  to  construct  G and 

3 

H but  using  the  quadratic  construction  with  A - -x  + 1 and 

3 0 

2 

B -x  + 2 x - 2 such  that  A G + B H - 1 (mod  q ). 

0 0 0 0 0 0 
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From  these  data  we  can  compute  G and  H in  (Z/q  ) lx]  as 

1 1 1 

3 

we  did  in  Example  11-3.1,  G • x + 10  x - 8 and 

1 

2 

H - x + 12  x - 7.  Next  we  find  R (x)  in  (Z/q  ) lx]  such 
1 0 0 

that  AG+BH-l+qR  (mod  q -25), 

0 10  1 0 0 1 

3 2 

R - ((15  x + 5 x - 20  x + 6)  - l)/q  (mod  q ) 

0 0 0 

3 2 

* -2  X +x  +X+1.  Applying  Algorithm  11-2.2  on  G , 

0 

H , A , B , and  R in  (Z/q  ) [x],  we  find  A*  - 2 and 
0 0 0 0 0 0 

B’  - x - 1 such  that  A’  G + B’  H ■ R (mod  q ).  Thus 
0 0 0 0 0 0 0 

2 

A - A - q A’  - -x  - 9,  B - B - q B’  - x - 3 x + 3, 

1 0 0 0 1 0 0 0 

2 

and  A G + B H ■ 1 (mod  q - p ).  Now  we  begin  the  next 
1111  1 

iteration  by  computing  C (x)  in  (Z/q  Kx]  such  that 

1 1 

q C - (F  - G H ) (mod  q - 625). 

11  11  2 

3 2 

C - (-25  x - 275  x + 475  x - 1751/25  (mod  25) 

1 

3 2 

- -x  - 11  x - 6 x - 7.  He  need  to  apply  Algorithm 
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1 1-2.2  on  G , H , A , 0 , and  C in  (Z/q  ) [x]  to  obtain  H’ , 
11111  1 1 

G’  9uch  that  H’  G + G'  H ■ C (mod  q ).  Conveniently, 

1 11111  1 


Example  1 1-2. 2c  has  done  just  that  with  H’  » 0 and 

1 


3 

G*  ■ -x  + 1.  So  G « G + q G’  • x - 15  x + 17  and 
1 2 111 


2 

H - H + q H’-H  -x  + 12  x - 7. 
2 1111 


Ue  have  now  constructed  the  desired  polynomials  using 
the  quadratic  method.  The  power  of  this  method  over  the  regu- 
lar Hensel  construction  was  not  clearly  demonstrated,  but  ue 
can  easily  9how  it  by  changing  the  problem  slightly.  If  we 
3 2 

let  F - G H - (x  - 265  x + 142)  (x  + 187  x - 107),  then 
for  the  regular  Hensel  construction  we  see  that  the  modulus 
uould  have  to  be  at  least  625  (since  it  is  the  least  power 
of  the  modulus  5 which  exceeds  2*265-530)  before  we  can  con- 


struct from  G and  H any  polynomial  with  coefficients  of 
1 1 


magnitude  up  to  265.  That  means  we  would  carry  out  at  least 

2 

three  9teps  of  the  Hensel  construction,  from  modulo  p to  p 
3 4 

to  p to  p - 625,  But  for  the  quadratic  method,  only  two 
iterations  uould  be  sufficient,  since  the  modulus  uould 
already  be  625.  Actually,  ue  intentionally  chose  G and  H 


mtmtftitoirtTliimntif1* — ■ - 


rn 
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2 

so  that  G » G ■ x +10x-8  and 
1 

2 

H - H ■ x + 12  x - 7 In  (Z/25)[xJ.  The  ortly  difference 
1 

in  the  computation  for  this  case  is  in  the  second  iteration, 

where  we  need  to  compute  a neu  C and  then,  fro*  G , H , A , 

1 111 

B , and  C , obtain  H’  ■ 7 x - A and  G'  ■ -9  x + E in 
111  1 

(Z/25)  lx]  such  that  H*  G + G*  H ;C  (mod  25).  Thus  we 

11  11  1 

3 

get  final ly  G » G + 25  G*  ■ x - 2G5  x + 1A2  and 
2 1 

2 

H - H + 25  H’  - x + 187  x - 107. 

2 1 1 

Of  course,  this  examp  Iff  only  shows  a saving  ot  one 

iteration  for  using  the  quadratic  method  instead  of  the 

regular  Hensel  construction.  As  the  coefficients  get  large 

enough,  the  difference  in  the  number  of  iterations  uill  be 

bigger.  For  k iterations  of  the  Hensel  construction,  the 

quadratic  method  needs  only  to  go  through  (1  + the  greatest 

integer  of  log  (k))  steps.  However,  let  us  point  out  an 
2 

important  fact  - the  quadratic  growth  in  the  modulus  its 
obtained  at  the  expense  of  having  to  solve 
A G + B H - R (mod  q ) at  each  step,  a cost  which 

j j j j j j 


ia  not  negligible.  Complicated  cost  trade-off  is  involved 
nere.  A more  detailed  study  of  this  trade-off  is  currently 
underuay.  Preliminary  results  for  the  univariate  cases  show 
that  the  cost  of  solving  these  equations  above  is 
sufficiently  dominant  so  as  to  make  the  quadratic  method 
less  efficient  than  the  regular  Hensel  construction  (with 
some  modifications).  There  are  also  reasons  to  etrongly 
suggest  that  the  multivariate  constructions  will  turn  out  to 
give  the  same  indications.  But  that  will  again  be  left  to  a 
later  study. 

Algor i thm  1 1-3.2: 

(Zassenhaus*  Quadratic  Extension  Algorithm) 

Input:  F(x)  in  Ztx],  p a prime  in  Z which  does  not  divide 

lc(F),  k > 0 in  Z,  G (x)  and  H (x)  in  (Z/p)  lx]  which  are 
0 0 

relatively  prime,  and  F ■ G H (mod  p). 

0 0 

k 

2 

Output:  G (x)  and  H (x)  in  (Z/q  ) tx)  where  q ■ p such 

k k k k 

that  F - G H (mod  q ) and  G ; G , H - H (mod  p). 
k k k k 0 k 0 

(1)  Apply  Algorithm  II-2.1  to  G ■ G and  K ■ H and  obtain 

0 0 

A,  B in  (Z/p)  (x)  such  that  A G + 8 H - 1 (mod  p). 

2 

Set  j <—  0,  q <--  p,  and  q’  <—  q . 


- . u. 
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(2)  If  j - k then  output  G and  H. 

Otherwise,  set  C < — (F  - G H)/q  and  C < — C (mod  q). 

(3)  Apply  Algorithm  II-2.2  on  G,  H,  A,  B,  ard  C in  (Z/q)  [xl 
to  get  A*  and  B’  in  (Z/q)  (xl  such  that 

A’  'J  + B’  H - C (mod  q) . 

Fet  G’  <—  G + q B’  and  H’  <—  H + q A*. 

(4)  Set  R <—  (A’  G*  + B’  H’  - l)/q  (mod  q’). 

(5)  Apply  Algorithm  II-2.2  on  G,  H,  A,  B,  and  R in  (Z/q)  tx] 
to  get  A*  and  B’  such  that  A’  G + B’  H ■ R in  (Z/q)  tx] . 

Set  A <—  A - q A*.  B <—  B - q B\  G <—  G\  and  H <~  H\ 

2 

(6)  Set  q <—  q’ , q’  <—  q , j <—  j + 1,  and  go  to  (2). 

II  - 4 Generalized  Hensel  Algorithm 

I4e  nou  present  the  Generalized  Hensel  Algorithm  (GHA) 
for  extending  univariate  factorizations  modulo  q to 
multivariate  factorizations  expressed  in  the  general  Taylor 
series  form  modulo  q uhere  q is  the  rth  pouer  of  a prime  p. 
Musser  [f1US71]  presented  a version  of  an  "abstract" 
algorithm  uhich  is  also  intended  to  extend  factorizations  of 
multivariate  polynomials.  Houever,  his  basic  underlying 
idea  of  recursively  applying  the  Zassenhaus  Quadratic 
Extension  Algorithm  has  the  drauback  of  requiring  a general 
algorithm  for  solving  multivariate  polynomial  equations  in 
the  form  of  A F + B G ■ H ui th  F,  G and  H given.  He  will 


| 

i 

i 


j 
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see  later  that  there  is  a Hensel-type  algorithm  which  will 


compute  A and  6.  However,  repeated  use  of  a multivariate 


polynomial  division  with  remainders  algorithm  in  Husser’s 


Algorithm  Q Keeps  his  version  of  the  "Generalized  Hensel 


Algorithm"  from  being  computationally  efficient.  Therefore, 


the  version  of  the  Generalized  Hensel  Algorithm  discussed 


here  is  based  rather  on  a presentation  by  Hang  and 


Rothschild  [USR731  in  the  context  of  factorization  even 


though  the  presentation  and  the  algorithm  itself  may  appear 


to  have  been  modified  to  a point  of  non-recognition. 


Theorem  1 1 -4. Is  (General i zed  Hensel) 


Let  F(x,y  ,y  ,...,y  ) be  a multivariate  polynomial  in 
1 2 v 


(Z [y y 1 ) (x) . Let  b - (b  , . . . ,b  ) be  a given  set  of 

1 v 1 v 


integral  values  and  S - (y  -b  ,...,y  -b  ) such  that  the 

11  v v 


leading  coefficient  of  F evaluated  at  y - b y - b 

11  v v 


satisfies  ( I c (F) ) - 0 (mod  q)  where  q is  a given  rih 

b 


power  of  a lucky  prime  p for  F . Assume  there  exist  re  la- 
ta 


tively  pr  ime  G (x)  and  H (xl  in  (Z/q)  (x)  » (J/S)  (x) , where 

1 1 


J - (Z/q)  (y  ,...,y  ),  such  that  F »G  H (mod  q).  Then 
1 v 11 

k 

for  any  k > 1,  there  exist  G and  H in  (J/S  ) (x)  such  that 

k k 


r 


F - G H (mod  q,  S ) and  G ; G , H ;H  (mod  q,  S). 
k k k 1 k 1 

Proof ; Since  (lc(F))  V0  (mod  q),  <t  must  be  a unit 

b r 

in  Z/q,  hence  lc(G  ) and  I c (H  ) must  be  unite  also.  By  Lemma 
1 1 

and  Corollary  1 1-2.1,  ue  can  use  Algorithm  1 1 -2. 1 to  find 

A (x)  and  B (x)  in  (J/S)  Cm)  such  that  A G + B H ■ 1 and 
1 1 1111 

deg  (A  ) < deg(H  ),  deg(B  ) < deg(G  ).  From  G , H , A , and 
1 11  1 111 

B in  (J/S)  [x]  ue  uill  construct  sequences  of  polynomials 
1 


IG  I and  IH  ) by  induction  such  that  F ; G H in  (J/S  ) Cm] 
mm  mm 

and  G ■ G , H - H (mod  q,  S).  Assume  for  m > 1,  ue 
m 1 m 1 


have  G (x,y  ,...,y  ) and  H (x,y  y ) in  (J/S  ) Cm)  such 

ml  v ml  v 

that  F - G H . Let 


R (x,y  , ...,y  ) ■ F - G H (mod  q,  S ) 
ml  v mm 


■ \ . . . N C (x)  (y  - b ) 

i , i , . • . , i i i 

i -1  i -1  1 2 m M j i 


Zc  (x)  n (y ) 

Iff)  ml  v 
Int 


For  each  C in  (Z/q) tx]  a typical  coefficient  polynomial  of 
Im 


R , we  can  apply  Algorithm  1 1-2.2  on  G , H , A , B , and  C 
m 1111  Im 


in  (Z/q) (x)  and  obtain  A , B such  that 

Im  Im 


A G + B H ■ C and  deg  (A  ) < deg(H  ) in  (Z/q)  lx] . 
Im  1 Im  1 Im  Im  1 


Non  let  G 


- \ ^ B (x)  N (y ) 

Im  ml  v 


\ ' A (x)  H (y  ,.,,,y  ) 
/ «J  Im  ml  v 


(m)  (m) 

G -G+G  , and  H -H+H  . 
m+1  m m+1  m 


(m)  (m) 

Then  G H ;GH+H  G+G  H 
m+1  m+1  m m 1 1 


i G H +R  ■ F in  J/S  , By  this  construction, 
mm  m 


(1)  (m) 

G ■ G +G  +...+G  and 
m+1  1 


(1)  (m) 

H «H+H  +...+H  ,so  that  G ■ ,G  and 
m+1  1 m ~ 1 


H - H n (J/S)  for  all  m > 1.  // 

m 1 


TOR'  ~ •****'— z.wir'K  «■ 


Im 


(m) 

Therefore,  in  addition  to  deg(H  ) < deg(H  ) because  of 

1 


(«) 

deg(A  ) < deg(H  ),  we  also  have  deg(G  ) < deg(G  ) for  all 
Im  1 1 

m > 1 because  of  deg(B  ) < deg(G  ). 

in  1 

Example  1 1-4. la:  Let  U9  first  do  a simple,  monic 

example  using  the  generalized  Hensel  construction  for  a 

multivariate  polynomial.  Let 

2 2 3 3 2 

F - x + (-z  -y  z +y  +z  +2)  x + y z -z  + y z -y  z + 2 z. 

Since  F is  monic,  there  is  no  problem  in  finding  a valid 

evaluation,  so  let  b — (b  , b ) - 10,  0)  and 

1 2 

S - ly-8,  z-01  - ly,  z) . This  Mill  make  the  problem  of 
picking  out  coefficients  of  monomials  very  simple.  In  fact, 
this  device  of  using  zeros  as  evaluation  points  Mi  1 1 be  used 
whenever  possible.  Assume  we  somehow  determined  a choice  of 
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q ■ P ■ 7,  then  ue  have  G (x)  - k and  H (x)  - x + 2 in 

1 1 


(Z/q)  lx]  such  that  F - G H (mod  q).  In  this  case  ue 

1 1 


will  let  k - 3 and  carry  out  two  iterations  of  the  con- 
struction. Again,  ue  reveal  in  advance  that  F is  actually 
the  product  of  G and  H ui th  G ■ x - y z + z and 


H = x + y - z +2;  and  ue  uill  see  hou  these  polynomials 


get  recovered  from  G and  H as  ue  proceed.  First,  ue  need 

1 1 


to  f ind  A (x)  and  B (x)  in  (J/S) [xl  - (Z/q)  [x]  such  that 

1 1 


A G + B H - 1 (mod  q).  By  applying  Algorithm  II-2.1, 
1111 


ue  get  A (x)  » 3 and  B (x)  - -3.  So  ue  compute  the  current 

1 1 


"residue"  R ■ F - G H (mod  q,  5 ).  I4e  find  that 

1 1 1 


2 3 3 2 

F - G H « (-z  -y  z +y  + z)  x + y z -z  -y  z -y  z + 2 z 
11 


Modulo  S means  dropping  all  terms  uith  term  degree  in  the 
non-main  variables  y and  z higher  than  1.  Thus 


fl  * (y  + z)  x + 2 z in  (J/S  )[x].  Representing  this  in 

1 


terms  of  monomials  of  the  non-main  variables,  ue  get 


R - (x)  y + (x  + 2)  z,  so  ue  have  C (x)  ■ x and 

1 1 


(1) 

G * 8 (x)  y + B (x)  z ■ z and 
1 2 


(1)  (1) 

H - A (x)  y + A (x)  z - y,  so  that  G ■ G + G <■  x + z 
12  2 1 


(1) 

and  H - H +H  - x + 2 + y.  Comparing  this  with  G and 
2 1 


H,  we  note  that  G and  H already  have  more  correct  terms 
2 2 


than  G and  H . Although  G and  H still  differ  from  G and 
11  2 2 


H by  some  terms  of  higher  degree,  they  now  contain  all  terms 
of  G and  H uhich  are  linear  in  the  non-main  variable  , i.e. 

2 

G and  H approximates  G and  H correctly  modulo  S » 

2 2 


Continuing  now  to  the  second  iteration,  ue  find 
3 

R - F - G H (mod  q,  S ) 

2 2 2 

2 3 2 3 3 

■(-yz-zlx+yz  -y  z-2yz-z  (mod  S ) 


2 2 

- (-y  z - z ) x - 2 y z • I-K  - 2)  y z + (-x)  z . So,  we 

have  C (x)  - -tx  + 2)  and  C (x)  ■ -x  such  that 
12  22 


R * C (x)  y z + C (x)  z . Again  it  is  trivial,  even 
2 12  22 

uithout  using  Algorithm  1 1-2.2,  to  see  that  A ■ 0, 

12 

B - -1  such  that  A G + B H ■ C and  A * -1, 
12  12  1 12  1 12  22 

B - 0 such  that  A G + B H - C . Thus 
22  22  1 22  1 22 

(2)  2 

G -B  yz  + B z • -y  z and 

12  22 

(2)  2 2 

H -A  yz  + A z--zso  that 

12  22 


G • G +G  ix  + z-  yz  and 

3 2 

(2)  2 

H - H +H  -x  + y + 2-  z.  Note  that  already 

3 2 

G ■ G and  H - H,  since  all  terms  quadratic  in  the  non-main 
3 3 

variables  have  been  included,  and  G,  H contain  only  up  to 


quadratic  terms  any  way  so  that  equality  modulo  S is  the 
same  as  being  equal  over  Z for  all  terms.  Indeed,  we  find 


F - G H - 0 over  the  integers. 
3 3 


G8 


Before  we  specify  the  details  of  the  Generalized  Hensel 
algorithm,  ueuill  discuss  several  computational  improve- 
ments which  will  be  important  in  the  actual  computational 
processes  later.  The  first  such  improvement  is  due  to  a 
suggestion  by  Moses. 

Lemma  11-4.2:  Let  G(x)  and  H (x)  be  relatively  prime 
polynomials  in  (Z/q)  txl  and  let  A (x)  and  B (x)  be  found 


such  that  A G + B H • x in  (Z  tx]  for  all  i where 
i I 


0 < i < deg(G)  + deg(H).  Then  for  any  C(x) 


such  lhat  deg(C)  - k < deg(G)  + deg(H),  polynomials  A (x) 

and  B(x)  in  (Z/q)  lx]  can  be  constructed  from  the  A and  B ’a 

i i 

such  that  A G + B H ■ C wnod  q). 


Proof:  Let  A c A and  B ^ c 


B ...(*), 


AG+BH-^S  c (A  G + B H)  ■ C (mod  q) 

^ i i i 


. // 


Remark:  If  A (x)  and  B (x)  are  found  usinq  Algorithm 


1 1 -2. 2,  then  by  Corollary  11-2.2  deg(A  ) < den(K!  and 


^ J 'll 


deg(B  ) < deg(G)  for  all  0 < i < deg(G)  + deg(H),  and  for 


i - deg (G)  + deg(H),  deg  (A  ) < deg(H),  and  deg  IB  ) < degIG). 

I I 


Thus,  since  c ’o  are  independent  of  x,  the  computation  of  A 


and  B involves  simply  the  additions  and  multiplications 
in  (*)  and  the  degree  constraints  for  A and  B are  automati- 
cally satisfied:  deg (A)  < degIH),  deg (B)  < degIG)  for  all 

C such  that  deg  1C)  < degIH)  + deg(G)  and  otherwise 
deg(B)  <deg(G). 

Also,  the  c ’ s,  instead  of  being  elements  of  (Z/q) , 


can  be  polynomials  which  are  independent  of  x (or  c in 


(Z/q)  [y y )),  and  the  computation  of  A and  B is  still 

1 v 

simply  a "scalar  linear  combination"  of  A ’s  and  B ’s. 


Coming  back  to  the  generalized  Hensel's  construction, 

we  can  urite  any  R (x,y y ) as  polynomials  in 

ml  v 


(J/S  ) (x) , i.e.  in  the  form 


of£  C (y 


, ...,y  ) x where 


C ’ s are  in  (Z/q) ty  ,...,y  ] and  k is  the  maximum  degree  of 
i 1 v 

x in  R (clearly  k < deg(F)  ■ deg(G  ) + degIH  )).  Thus 
m 11 


/ C (y  I • a • | 
' i 1 


,y  ) B (x)  and 


■Lv 


< , . . , y ) A (x)  so  that 


(m)  (m)  ^ 

H G + G H - > C (y ) (A  G + B H ) 

1 1 L-J  j l v i 1 i 1 


m R in  (J/S  ) (xl.  So  ue  see  that,  in  order  to 


(tn)  (m) 

compute  G and  H it  is  no  longer  necessary  to  solve  all 

the  pciynomial  equatione  A G + B H ■ C , but  simply 

I m i Im  1 In 


use  the  solutions  of  A G +B  H - x to  form  scalar 

i 1 i 1 

linear  combinations  as  needed. 

Next,  we  observe  that  it  is  not  necessary  to  compute 


R by  the  formula  F - G H (mod  q,  S ),  but  we  can 
m+1  «+l  m+1 

get  R by  the  follouing  scheme  using  quantities  already 
m+1 

computed  previously: 

As  in  the  proof  of  the  Theorem  (1 1-4.1),  ue  urite 


71 


(m)  (1)  (m) 

G * G + G ■ G + G + ...  + G 

m+1  m 1 

(m)  (1)  (m) 

H -H+H  -H+H  + ...  + H 

m+1  m 1 

Define  0 - F - G H where  the  arithmetical  operations  are 
rn  in  ib 

done  over  Z with  G H considered  as  polynomials  in 

m m 


m+1 

Z(x,y y J.  Then  R is  simply  D (mod  q,  S ) and 

1 v m m 

m 

D » 0 (mod  S ),  i.e.  D contains  non-zero  terms  only  in 
m m 


k 

(J/S  ) (x)  for  k > ra.  Thus  D - F - G H and 

1 1 1 

2 

R - D (mod  q,  S ).  In  general, 

1 1 


(m)  (m)  (m)  (m) 


D " F 

- G H 

- 0 

- G 

; H 

- 

G H - G H 

m+1 

m+1  m+1 

m 

m 

m 

m+2 

and  R 

0 (mod 

q,  S 

) 

for 

all  i 

m > 1.  (Since 

m+1 

m+1 

m+1 

D - 0 

(mod  S ) , 

R 

is 

obtained 

from  D by  simply 

m+1 

m+1 

m+1 

setting  all  flk (y  , . . 

.,y  ) 

to 

0 in  the 

general ized  Taylor 

1 

V 

series  form  of  D , 

for 

all 

k > 

m+1. 

) 

m+1 


Finally,  we  note  that  the  generalized  Taylor  series  form 

ie  not  always  a very  easy  representation  to  obtain.  To 

rewrite  a multivariate  polynomial  P(x,y ) in  that  form 

1 v 

it  will  be  necessary  to  divide  P successively  by  each  and 

every  (y  - b ) until  all  combinations  of  n such  (y  - b )’s 
' ' i i 

are  used  where  n is  the  maximum  term  degree  of  the  non-main 

variables.  This  i s a combinator ial ly  complex  process.  It 

is,  therefore  better  to  use  the  following  more  efficient 

technique:  (However,  it  is  important  to  note  the  this  is 

not  necessarily  the  optimal  technique,  as  will  be  seen  later 

when  we  discuss  the  so  called  "Non-zero  Substitution" 

problem  of  this  method.) 

Let  y’  - y - b for  i - l,...,v.  Then  y - y’  + b eo 
' * ' i i i 

that  if  we  completely  expand  P(x,y’+b  ,...,y'+b  ) and  express 

11  v v 

that  in  the  Taylor  series  form  with  b’  (the  integral  evalu- 

I 

ation  values  for  y’)  - 0 for  all  i,  we  get 
i 

n 

P(x,y  ,,..,y  ) • \ C (x)  (1m (y y' ) where 
1 v Im  1 y 

m-0 


m 


- b 


Mm(y  , . . . ,y  ) . 


(1m(y’ 
1 


(Hence  the  name  "monomial"  used  for  Hm  in  Sec.  I - 2.) 

Note  that  with  this  representation  of  P,  S - (y’,...,y'}  and 

1 v 

m+1 

P (mod  S ) is  a simple  operation  of  dropping  (or  setting 

to  zero)  all  monomials  whose  term  degree  exceed  m with  no 

re-representation  or  divisions  by  (y  - b ) necessary.  Also 

I i 

it  is  an  equally  simple  matter  to  transform  P(x,y’ y*) 

1 v 

back  to  P(x,y  ,...,y  ) — by  substituting  y - b for  y’. 

1 v i i i 

Example  1 1 -4. lb:  Uith  all  the  computational 
mechanisms  described  above,  we  can  now  do  a more  complex 
example  before  we  give  the  detailed  specifications  of  the 
Generalized  Hensel  Algorithm.  This  example  is  the  same  as 
Example  I(-4.1a  except  the  leading  terms  non  have  non- 
trivial coefficients.  Let  G-yx  + z-  yz  and 
2 

H ■ y x + y - z +2,  so  that  the  given  polynomial 

2 2 

F - G H ■ y x 

2 2 2 3 3 2 

+ (-y  z -y  z +y  z +y  + 2 y)  x + y z -z  -y  z -y  z +2  z. 

In  order  for  the  evaluation  to  be  valid  for  this  polynomial, 

2 

ue  can  only  use  non-zero  values  for  y since  I c (F)  - y . Let 
b - (b  . b ) ■ (1,  0)  and  S - (y-1,  z) . Let  q ■ p ■ 7,  then 


mmm 


f 


| 

f{ 

| 

I 

I 

L 
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we  have  G (x)  ■ x and  H (x)  ■ x + 3 in  (Z/q) tx]  such  that 

1 1 

F - G H (mod  q).  Ue  let  k ■ 4 and  carry  out  three  itera- 
1 1 

tione  of  the  construction  to  compute  G and  H such  that 

3 3 


3 

F - G H (mod  q,  S ).  First,  ue  apply  Algorithm  II-2.1  on 
3 3 

G , H in  (Z/q) (x]  and  find  A (x)  and  B (x)  such  that 
1 1 00  08 

A G + B H - 1 (mod  q).  But  Example  II -2. la  has 
00  1 00  1 

already  done  so  with  A • 2 and  B - -2.  Then,  according 

10  10 

to  the  first  computational  suggestion,  ue  compute  A (x) , 

01 

B (x)  such  that  A G + B H • x (mod  q)  and  A (x), 

01  01  1 01  1 02 

2 

B (x)  such  that  A G + B H - X (mod  q).  Since 

02  02  1 02  1 ' 

G - x , it  is  clear  that  A - 1 and  B - 3,  and  conve- 
1 01  01 

niently  Example  1 1 -2 . 2a  readily  gives  us  A * -3  and 

02 

B "X.  Next,  we  perform  the  suggested  substitution  in 
02 

order  to  facilitate  the  necessary  operations  of  putting 


m 

polynomials  modulo  S later.  Let  y’  ■ y - 1 so  that 
y - y’  +1.  Substituting  this  into  F,  we  get 


} 

1 


/ 


2 2 

F’  - (y*  + 2 y'  + 1)  x 

2 2 2 2 

+ (-y*  z - z - y’  z - y’  z + y’  - 3 y’  + 3)  x 

3 2 

+ y’  z - y*  z - 3 y’  z.  With  this  substitution, 

S - iy-1,  zi  becomes  S’  - ly’,  zl  which  is  a convenient  form 
for  picking  out  coefficients  of  monomials  or  performing 


modulo  S’  operations. 

Having  carried  out  these  preliminary  computations,  ue 

can  proceed  to  go  through  the  steps  of  the  first  iteration 

of  the  General i zed  Hense I construction.  According  to 

another  computational  suggestion  above,  we  compute 

D ■ F’  - G H over  Z and  get 

1 1 1 

2 2 

D - (y*  + 2 y'l  x 

1 

2 2 2 2 3 

+ (-y’  z - z -y’  z-y’  z + y’  -3y’)x  + y’z 


- y’  z - 3 y’  z. 

2 2 2 
R - 0 (mod  q,  S’  I ■ 2 y’  x - 3 y*  x - (2  x - 3 x)  y’ 
1 1 

- C (x)  y’ . From  the  preparations  already  done,  ue 
1 

have  A (x)  ■ 2 A (x)  + (-3)  A (x)  ■ -2  and 
1 02  01 

B (x)  - 2 B (x)  + (-3)  B (x)  - 2 x such  that 
1 02  01 


A H + B G ■ C (nod  q)<  So  G is  simply 
11  11  1 


B y’  * 2 x y*  and  H - A y'  ■ -2  y'.  Then 
1 1 

(1)  (1) 

G ■ G +G  - x + 2 x y*  and  H -H  + H ■ x + 3 - 2 y' 
2 1 2 1 


such  that  G H ■ F*  (nod  q.  S’  ). 

2 2 

Next,  ue  compute 

(1)  (1)  1)  (1) 
D-D-GH  -G  H - b H 

2 11  1 

2 2 2 2 2 2 
- y’  x + (-y*  z - 2 y'  -y’  z-y’  z-Jt)x 

2 3 

- y’  z + y’  z - 3 y’  z.  Then 


R ■ 0 (mod  q.  S’  ) 

2 2 

2 2 2 2 

- y’  X + (-2  y’  - y’  z - z ) x - 3 y’  z 

2 2 2 

- (x  - 2x)  y’  + (-x  - 3)  y’  z + (-x)  z 

2 2 
■ C (x)  y'  + C (x)  y’  z + C lx)  z . By  taking 
11  12  22 

scalar  linear  combinations  according  to  the  coefficient 

polynomials  C , C , and  C , ue  easily  get 
11  12  13 


x,  juch 


A (x)  - 1 A - 2 A - 2,  B (x)  • 1 B - 2 B - 
11  02  01  11  02  01 


that 

A G 

+ B H 

- C ; 

A (x)  - 0, 

B (x)  - -1, 

such 

11  1 

11  1 

11 

12 

12 

that 

A G 

+ B H 

• C 5 

and  A (x) 

» -1,  B (x) 

- 0 

12  1 

12  1 

12 

22 

22 

such 

that  A 

G + B 

H i 

C (mod  q) 

. Then 

22  1 22  1 22 


(2)  2 22 

G *B  y’+B  y’  z + 8 z-xy’-y’  z and 

11  12  22 

(2)  2 2 2 2 

H -A  y*  + A y*  z + A z * 2 y’  - z , so  that 

11  12  22 

(2)  2 
G -G  + G « x + 2 x y'  + x y’  - y*  z, 

3 2 

(2)  2 2 
H - H +H  »x  + 3-  2y'+2y’  - z,  and 

3 2 

3 

G H - F’  (mod  q,  S’  ) . 

3 3" 

Finally,  ue  compute 

(2)  (2)  (2)  (2) 

D-D-GH  -G  H-G  H 

3 2 2 2 

43222  2 3 2 

- (-2  y’  -2  y'  +y’  z -y’  z +y’  z ) x +2  y’  z -3  y’  z. 

4 3 2-2  2 

Then  R ■ D (mod  q.  S’  ) ■ (-2  y’  -y*  z +y'  z )x  -3  y'  z 
3 3 

3 2 2 

- (-2  x)  y’  + (-X  - 3)  y’  z + (x)  y’  z 


WK**»  »»W«KW(? 


78 


3 2 2 

■ C (x)  y * + C (x)  y'  z + C (x)  y*  z . 

Ill  112  122 

Again,  we  can  ccmpute,  by  scalar  linear  combinations, 

A - -2,  0 - 0,  A » 0,  B - -1,  A » 1,  and 

111  111  112  112  122 


B - 0.  Thus 
122 


(3)  3 

2 

2 2 

G - B y’ 

+ B 

y’  z + B 

y*  z - -y’  z, 

111 

112 

122 

(3)  3 

2 

2 3 2 

H - A y’ 

+ A 

y*  z + A 

y’  Z • -2  y’  + y’  z , 

111 

112 

122 

(3) 

2 2 

so  that  G « G 

+ G 

■ x + 2 x z’ 

+ x y’  - y*  z - y*  z 

4 3 

2 

- (y*  + 2 y*  + 1)  x - (y’  + 1)  y’  z, 

(3)  2 2 3 2 

H -H  + H - x ♦ 3 - 2 y’  +2  y’  -z  - 2 y’  + y*  z , 

4 3 

4 

and  G H - F’  (mod  q,  S'  ). 

4 4 

At  this  point,  one  might  object  to  the  fact  that  neither 

G nor  H has  any  resemblance  to  G’  or  H’  (G  or  H with 
4 4 

y’  + 1 substituted  for  y)  at  all.  This  is  a complication 

caused  by  the  non-trivial  leading  coefficient  of  the  given 

polynomial.  All  we  can  point  out  here  is  that  if  G were 

4 


transformed  back  into  a polynomial  in  y by  substituting 
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y - 1 for  y’  (us  get  y x - (y  - 1)  z),  then  it  becomes  a 
simple  multiple  (by  y)  of  U.  Ue  will  give  a more  complete 
discussion  of  this  in  the  next  section,  where  this  example 
will  be  continued  and  G,  H will  actually  t recovered. 
Here,  it  is  only  important  to  observe  that  we  can  actually 


find  the  sequences  of  polynomials  IG  , G , G ) and 

2 3 4 


IH  , H , H I such  that  G H ■ F’  (mod  q,  S’  ), 
2 3 4 mm 


m - 2,  3,  and  4. 

Alqori thm  1 1 -4.1  (General i zed  Hense I ) 


Input:  F (x,y  ,...,y  ) in  (Z(y J)(x]{  b - (b  ,...,b  ) 

1 v 1 v 1 v 


a given  set  of  integer  values  such  that  ( I c (F) ) 4 0 (mod  q) 

b r 


where  q is  a given  lucky  prime  power  for  F ; 

b 


S » ly  -b  , . . . ,y  -b  ) ; G (x)  and  H (x)  are  relatively  prime 
11  v v 1 1 


polynomials  in  (J/S) Cx]  where  J - (Z/q) (y  ,...,y  ] such  that 

1 v 


F • G H (mod  q,  S);  and  n ■ the  degree  bound  for  F in 
1 1 


the  non-main  variable. 


n+1 


Output:  G and  H in  (J/S  ) [x]  such  that 

n+1  n+1 


n+1 


F « G H (mod  q,  S ) and  G ^ G , 
n+1  n+1  n+1  l 


— 


40 


H » H (mod  q,  S). 
n+1  1 

(1)  Set  F <—  F(x,y  +b  +b  ),  S <—  (y ),  and 

11  v v 1 v 

m < — 1. 

(2)  Apply  Algorithm  U-2.1  on  G - G and  H - H in  (Z/q)  [x] 

1 1 

to  get  A , B such  that  A G + B H-lin  (Z/q)  (x)  and 
0 0 0 0 

deg  (A  ) < deg(H),  deg(B  ) < deg(G). 

0 0 


(3)  Apply  Algorithm  II-2.2  on  G,  H,  A , B , and  x tor 

0 0 

0 < i < deg(F)  - deg(G)  + deg(H)  and  get  A , B in  (Z/q)  lx) 

i I 


such  that  A G + B H - x (mod  q),  deg(A  ) < deg(H)  for  all 

! : ! • 


i < deg(F),  deg(B  ) < deg (G)  for  all  i < deg(F),  and 


deg(B  ) < deg(G)  if  i - deg(F). 
i 

(4)  Set  0 <-  F - G H,  G’  <~  0,  H’  <~  0. 

(5)  If  m = n + 1,  set  G <—  G",  H <—  H"  and  go  to  (7), 
otherwise,  set  m <—  m + 1.  Set 

D < D - G H’  - G1  H - G'  H',  G < — G",  and  H < — H",  where 

all  arithmetic  operations  are  carried  out  over  Z/q  and  F, 

G,  H considered  in  (Z/q)  (x,y ).  I f D - 0,  then  go  to 

1 v 
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(7),  otherwise,  set  R < — 0 (mod  S ),  i.e.  drop  all  monomials 
in  D with  term  degree  > m and  express  R as 


Lvmi 


y ) x , k < deg(F) 


by  collecting  all  coefficients  of  like  powers  of  x. 


(G)  Set  G 


v H’  v 


G"  < — G + G’ , H"  <—  H + H’,  and  go  to  (S) . 

(7)  Output  G(x,y  -b  ,...,y  -b  ) and  H(x,y  -b -b  ). 

11  v v 11  v v 


II  - 5 Hensel  Construction  over  the  Integers. 

Due  to  its  theoretical  importance  and  for  pedagogical 
interests,  we  now  present  a theorem  which  contains  all  the 
underlying  ideas  of  the  generalized  Hensei’s  construction 
for  extending  factors  over  the  integers.  The  theorem  shows 
that  this  method  of  construction  can  be  carried  out  directly 
in  Z.  The  General  i zed  Hensel  Algorithm  is  simply  a special 
case  of  this  theorem  where,  for  computational  efficiencies, 
the  underlying  numerical  coefficient  space  will  be  Z/q  such 
that  q is  a prime  power, p**r,  greater  than  any  of  the 
integer  coefficients  in  the  original  polynomial,  its 
factors,  and  their  evaluations  at  the  given  v integral 


Lemma  1 1 -5.1:  Let  Jtxl  denote  a general  polynomial 

domain  and  S denote  a general  modulus.  Assume  G H - G'  H' 
k 

in  (J/S  ) [x]  where  G - G’  ; G\  H - H’  - H"  for  some 
relatively  prime  polynomials  G"(x),  H" (x)  in  (J/S)  tx] , 
deg (G)  - deg(G’)  - deg(G"),  deg(H)  - deg(H’)  - deg(H").  The 

k 

G and  G’ . H and  H’  are  associates  in  (J/S  ) lx]  (i.e. 

G » u G'  and  H - v H*  where  u and  v are  units  and 
k 

u v - 1 in  (J/S  ) (x)  or  G/lc(G)  and  H/lc(H),  are  identi- 
cally equal  to  G’/lc(G’)  andHVlc(H’)  respectively  where 

k 

divisions  are  carried  out  in  (J/S  ) tx]  and  the  leading 

k 

coefficients  are  inherently  assumed  to  be  units  in  (J/S  )). 

Proof:  Let  0 ■ gcd(G.G')  so  that  G ■ D C,  G »DC. 
and  gcdlC.C' ) ■ 1.  Thus,  DCH«GH«G  H • 0 C H 
»=>  C divides  H’  C (mod  S)  divides  H’  (mod  S)  - H". 

Clearly  also  C (mod  S)  divides  G (mod  S)  - G".  C (mod  S) , 
thus,  divides  gcd(G",HH)  ■ 1 ••>  deg(C  (mod  S) ) ■ 0.  But 

k 

deg(C)  - deg(C  (mod  S)>  - 0 in  (J/S  ) lx]  and  C must  be  a 
k 

unit  in  (J/S  ) (x) , since  C (mod  S)  divides  1.  Similarly  C* 
is  aloo  a unit.  Therefore,  G and  G’  are  associates  and 
similarly  with  H and  H’.  Since  lc(G)  and  lc(G  ) are  units, 
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we  can  compute  the  ionic  polynomials  G/lc(G)  and  G’/lclG’) 
k 

in  (J/S  ) [x] . These  polynomials  when  considered  in  J[x] 

must  be  equal.  Similarly  H/IcCH)  - H’/lc(H’).  // 

Theorem  1 1 -5. 2s  Let  F(x,y  ,y  ) be  a monic 

1 2 v 

multivariate  polynomial  in  J[X],  J - Z[y y ], 

1 v 

b - lb  ,...,b  I be  a given  set  of  integral  values,  and 

I v 

S - ly  -b  , . . . ,y  -b  I such  that  the  leading  coefficient  of 

II  v v 

F satisfies  ( I c (F) ) ^ 0 or  lc(F)^0  (modS).  Assume  F 


has  a factorization  in  J(x]  such  that  F ■ G H where  G ■ G , 

b 1 

H ■ H and  gcd(G  , H ) ■ 1,  then  there  exists  a positive  n 
b 1 11 

sufficiently  iarge  (in  fact,  n • degree  bound  of  F or  max  td 

i i 

aluays  suffices!  such  that  G-G,H-H,F-G  H over  Z 

n n n n 

G - G and  H - H (mod  S) . 
n 1 n 1 

Proof:  If  we  consider  G and  H to  be  polynomials  in 

1 1 

(Z/Z)  [x]  which  is  an  Euclidean  domain,  then  we  can  apply 

Algorithm  1 1-2.1  and  aet  A (x),  B (x)  in  (Z/Z) (x]  such  that 

1 1 

A G + B H - 1 and  deg (A  ) < deg(H  ),  deg(B  ) < deg(G  ). 

1111  1 1 1 1 
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Similar  to  the  proof  of  Theorem  1 1-2.5,  we  Mill  conetruct 


sequences  of  polynomials  (G  I and  1H  I IN  Jtx)  by  induction 

m m 


such  that  F ■ G H in  (J/S  ) (x)  and 
m m 


G = G , H - H (mod  S) . For  the  inductive  step,  ue  let 
m 1 m 1 


m+1 


R (x,y  ,...,y  ) - F - G H (mod  S ) 
ml  v mm 


/ C (x)  Mm(y  ) uherB  each  C (x)  is  in  Z(x], 

J In  1 v Im 


1 v 


Im 


Im 


For  each  C (x)  Me  apply  Algorithm  1 1-2.2  on  G , H , A , B , 
Im  1111 


and  C as  i f Me  are  in  (Z/Z) Cx]  and  obtain  A , B in 
Im  Im  Im 


(Z/Z)  (x)  such  that  A G + B H ■ C and 
Im  1 (ml  Im 


deg  (A  ) < deg(H  ).  Assume  G and  H ire  put  In  th*  general  i- 
Im  1 


(1)  (n) 

zed  Taylor  series  form  where  G-G  +G  +...+G  and 

1 


(1)  (n) 

H-H+H  +...+H  . But  then 

1 


(m) 


■s 


B’  (x)  Tim (y  ,...,y  ) and 
Im  1 v 


Im 





km  iiiMiiiiiii 


i 


i 


85 


H - j A’  (x)  flm(y  , ...,y  I where  A*  and  B*  are  in 
Im  1 v I m Im 

Im 


Ztxl.  Furthermore,  we  must  have  A’  G + B’  H - C iri 

Im  1 Im  1 Im 

Z lx] , 9i nee  otherwise  GH  when  multiplied  in  their  series 

form  can  not  equal  to  F in  the  integers.  Also  since  F is 

monic,  deg(C  I < deg(R  I < deg(F)  ■ deg(G  ) + deg(H  ) for 

Im  m 11 

all  m > 1,  hence  deg (A’  ) < deg(H  ) and  deg(B’  ) < deg(G  ). 

Im  1 Im  1 

Nou,  consider  A’  and  B’  to  be  polynomials  in  IZ/Z)  tx] . Ue 
Im  Im 

then  have  (A  - A*  ) G - (B*  - B ) H in  (Z/Z)  [x] . Since 

Im  Im  1 Im  Im  1 

G and  H are  relatively  prime,  H must  divide  A - A’  , yet 

11  1 Im  Im 

deg(A  - A’  ) < deg(H  ).  Thus,  A - A’  in  (Z/Z)  [x]  or, 

Im  Im  1 Im  Im 

in  fact,  in  Z(x).  With  this  relation  and  the  fact  that  H 


is  monic,  we  must  also  have  B • B’  in  ZIxJ.  Therefore, 

Im  Im 

(m)  (m) 

the  formula  for  G and  H is  uniquely  determined  by  this 

construction  and  it  '19  routine  to  verify 

(m)  (m)  m+1 

G H - (G+GMH+HJ-F  (mod  S ).  Also 
m+1  m+1  m m 


it  is  clear  that  G ■ G , H - H (mod  S)  for  all  m > 1. 


Xi'rfy-X&i*  v-ri-  mturt.-  .-v.  '»*vi  flmotf  rnr.  «i: 


Uhen  m - n,  the  term  degree  bound  of  F,  any  divisor  of  F must 


be  uniquely  represented,  or  identically  equal  to  an  element 


n+1  n+1 

in  (J/S  ) (xl . Consider,  then,  G H » F - G H (mod  S ), 


Lemma  1 1 -5.1  implies  G and  G , H and  H are  associates  in 


(J/S  ) (xl . Since  F is  monic,  G must  also  be  monic.  But 


G is,  by  construction,  monic  since  G is  monic  and 
n 1 


deg(G  ) < deg(G  ).  Therefore  G » G over  Z.  Similarly 
1 n 


H * H , and  F - G H over  Z.  // 


In  addition  to  demonstrating  feasibility  of  making 


multivariate  Hensel  construction  directly  over  the  integers, 


Theorem  11-5.2  also  shows  that  under  suitable  constraints 


factors  or  divisors  of  polynomials  over  Z can  be  reconstruct- 


ed from  their  evaluations  at  well  chosen  points.  Ue  uill 


now  discuss  extensions  of  the  Zassenhaus’  Algor :thm  and  the 


Generalized  Hensel  Algorithm  so  that  divisors  of  polynomials 


over  Z can  be  reconstructed  using  these  computationally  more 


efficient  "modular"  algorithms. 


The  main  add;  lonal  operation  necessary  for  achieving 


this  is  called  the  Restore  Leading  Coefficient  (RLC)  opera- 


tion (the  reason  for  this  name  will  be  clear  later). 


Theorem  1 1-5.3  (EZ  - Extended  Zassenhaus) 

Let  F(x)  be  a primitive  polynomial  in  Ztx]  and  p be  a prime 
in  Z u'-ich  does  not  divide  the  leading  coefficient  of  Fix). 
Assume  Glx)  in  Ztx]  is  a divisor  of  F,  F ■ G H,  such  that 


G - G (mod  p)  and  H - H (mod  p)  are  relatively  prime  in 
0 0 


(Z/p)  txl.  Then  for  a sufficiently  large  integer  k > 0,  the 


G ( x } and  H (x)  in  (Z/q  ) txl  of  Lemma  1 1-3.2,  uhere  q - p , 
k k k k 


found  by  applying  Algorithm  1 1-3.2  on  F,  G , and  H can  be 

0 0 


used  to  reconstruct  G and  H directly. 


Proof:  By  Lemma  1 1-3.2  we  can  find,  for  any  k > 0,  G (x) 

k 


and  M (x)  in  (Z/q  ) [x]  such  that  F • G H (mod  q ) and 
k k k k k 


G - G , H ■ H (mod  p).  Let  k be  chosen  such  that 
k 0 k 0 


q > 2 B I c (F)  uhere  8 in  Z bounds  the  magnitude  of  coef- 
k 


ficients  of  F and  any  of  its  divisors  uith  the  particular 


degrees  of  G and  H . (One  such  absolute  but  large  bound  B 
0 0 


d 1/2 

can  be  computed  from  F by  the  formula  (2r)  /(3d/2)  uhere 


d « max(deg(G),  deg (H) ) and  r - maximum  magnitude  of  all 
complex  roots  of  F - for  its  computation,  see  [KNU691 • This 
formula  is  derived  by  applying  Sterling’s  approximation  to 


...  J ■ — 


JPWUlttW  i 
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d 

the  maximum  coefficient  of  (x  + r)  .)  For  this  k,  any  divi- 
sor 0 of  F can  at  most  differ  from  the  canonical  represen- 
tative of  its  equivalence  class  in  (Z/q  ) lx]  by  a unit  in 

k 

(Z/q  ) i.e.  0/ I c (0)  (mod  q ) is  unit  normal  rr  ie  the  unique 
k k 

canonical  class  representative. 

By  Lemma  1 1-5.1,  G and  G , H and  H are  associates  in 

k k 

(Z/q  HxJ.  Since  I c (F)  is  a unit  in  (Z/p)  hence  also  in 
k 

(Z/q  ),  so  I c (G)  and  Ic (H)  are  also  units  in  (Z/q  ).  Since 
k k 

I c (F)  - I c (G  ) I c (H  ),  lc(G  ) and  I c (H  ) must  also  be 
k k k k 

units  in  (Z/q  ).  Then,  since  q is  sufficiently  large, 
k k 

H/lc(H)  (mod  q ) - H / I c (H  ) (mod  q ) -«> 
k k k k 

H’  - I c (F ) H/lc(H)  (mod  q ) - H"  - lc(F)  H /lc(H  ) (mod  q ) 

k k k k 

where  the  equality  now  holds  over  Z.  But  I c (F) / I c (H)  - lc(G 

in  Ztx],  hence  I c (G)  also  divides  into  H".  In  fact,  since  F 

is  primitive,  so  must  H be.  Hence,  ue  must  have 

I c (G)  ■ cont(H')  » cont(H")  and 

H - pp(H’)  - pp(H")  - pp(lc(F)  (H  / Ic (H  ))  (mod  q ))  over  Z. 

k k k 


Similarly  G * pp ( I c (F ) (G  /lc(G’i)  (mod  q ))  over  Z. 


Si  Ml-H.-p,,  -MU  JU1I  III 
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(Actually,  once  H has  been  computed,  I c (G)  » lc(F)/lc(H)  ana 

G ■ I c (G)  G / 1 c (G  ) (mod  q ) 
k k k 

- cont(H")  G / I c (G  ) (mod  q ).)  II 

k k k 

Example.  1 1-5.3:  Let  us  now  continue  from  Example 

1 1-3.2,  where  given 

5 4 3 2 

F«x  + 12  x -22x  - 'G3  x + 309  x - 119  and  p ■ 5 ue 
used  the  quadratic  construction  to  compute 
3 2 

G -x  - 15  x • 17  and  H - x + 12  x - 7 from  the  modulo 

2 2 

3 2 

p cofactors  of  F,  G ■ x +2  and  H »x  + 2 x - 2.  Note 

0 0 

that  Example  1 1 -3. 2 gives  us  G H - F (mod  q - G25)  and 

2 2 2 

the  modulus  G25  clearly  exceeds  the  bound  of  the  numerical 

coefficients  in  any  involved  polynomial.  Also,  since  F,  G , 

2 

H are  monic,  there  is  nothing  to  be  done  for  the  res'ore- 

2 

leading-coefficient  operation.  Thus  we  get  directly  from 

the  results  of  Example  11-3.2,  G and  H such  that  F • G H 

2 2 2 2 

over  the  integers.  It  should  be  realized  that  had  F been 

nonmonic,  G and  H obtained  from  the  mod  p factorization 

2 2 

of  F (some  G and  H ) uould  not  be  fetors  of  F over  Z,  so 
0 0 


j ... 
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that  they  must  have  the  Restore  Leading  Coefficient  opera- 
tions performed  on  them.  Since  this  fact  ui  1 1 be  demon- 
strated clearly  in  the  next  example  involving  multivariate 
polynomials,  we  will  not  dwell  on  it  here  but  simply  point 
out  that  so  long  as  the  modulus  is  sufficiently  large,  the 
modular  factors  of  F can  actually  be  vised  to  recover  the 
correct  factors  of  F over  Z regardless  of  whether  F ia  monic 
or  not. 

Algorithm  1 1-5.3:  (Univariate  EZ  Algor i thm5 

1 nput:  F(x)  primitive  in  Z(x],  p prime  in  Z uhich 

does  not  divide  lc(F),  and  G (x),  H (x)  rslati-ely  prime  in 

0 0 

(Z/p)  (xl  such  that  F • G H (mod  p) . 

0 0 

output;  If  F has  divisors  G and  H such  that  F ■ G H 

in  Z and  G = G , H - H (mod  p),  then  G and  H wi  1 1 be  re- 
0 0 

k 

2 

turned.  Otherwise  the  outputs  will  be  G , H , and  q - p 

k k k 

such  that  F - G H (mod  q ) and  q bounds  the  coefficients 
k k k k 

of  F and  any  of  its  divisors  with  degrees  less  than 

max (deg (G  ) , deg(H  ) ) . 

0 0 

(1)  Set  d < — max  (deg  (G  ),  deg(H  )), 

0 0 


I 


1 


i 


I 


i 


l 


I 

i 

| 


r < — maximum  magnitude  of  ail  roots  of  F, 


• V' 


d 1/2 

B <--  (2r)  /(3d/2)  , and 


k <--  minimum  integer  such  that  q > 2 B I c (F) . 

k 


(2)  Apply  Zassenhaus’  Quadratic  Extension  Algorithm  tl  1-3.2) 


on  F,  p,  k,  G , and  H to  obtain  G (x)  and  H (x)  in  (Z/q  ) tx] 
0 0 k k k 


such  that  F - G H (mod  q ) and  G ■ G , H ■ H (mod  p). 
k k k k 0 k 0 


(3)  Restore  Leading  Coefficient! 


Set  H <~  IcT)  (H  / 1 c (H  ))  (mod  q ). 

k k k 


If  H divides  (lc(F)  F)  over  Z,  then  set  H <—  pp(H)  over  Z, 


G <~  cont(H)  (G  / 1 c (G  ))  (mod  q ),  and  return  G and  H. 
k k k 


(4)  Otherwise  return  G , H , and  q . 

k k k 


In  the  multivariate  case,  we  can  similarly  extend  the 


General i zed  Hense I Algorithm. 


Theorem  II-5.4:  (Mu I tivariate  EZ) 


Let  F(x,y  ) be  a multivariate  polynomial  in 

1 v 


(Z (y  ,...,y  I ) (xl  primitive  w.r.t.  (with  respect  to)  x.  Let 
1 v 


p be  a prime  in  Z and  b ■ (b  ,...,b  ) be  a set  of  integral 

1 v 


values  such  that  (lc(F))  /0  (mod  p).  Assume  there  exist 

b r 


G and  H in  (Z ty  , ...,y  ] ) Cxi  such  that  F - G H and 
1 v 


r 
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G - G (mod  q , S),  H - H (mod  q , S)  are  relatively  prime 
1 k 1 k 

in  (Z/q  ) lx]  or  (J/S) IkI  where  J ■ (Z/q  ) ly  , ...,y  1.  For 
k k 1 v 

sufficiently  large  integers  k > 0 and  n > 0,  the  G and  H 

n n 

n 

in  (J/S  ) lx]  found  by  applying  the  Generalized  Hensel  Algo- 
rithm UI-4.1)  on  F,  G , and  H can  be  used  to  reconstruct 

1 1 

G and  H directly. 

Proof:  Let  B in  Z be  larger  than  the  magnitude  of  any 

numerical  coefficients  of  F and  any  of  its  divisors  with  the 

particular  degrees  of  G and  H . (One  such  coefficient  bound 

1 1 

can  be  computed  using  a method  discussed  in  IU8R73J . In 
most  cases,  it  is  sufficient  to  let  B be  the  maximum  of  the 
coefficient  magnitudes  of  F and  the  hound  computed  from  F 

b 

using  the  method  discussed  in  the  proof  of  Theorem  1 1-5.3.) 
Let  k > 0 be  such  that 
2 

q > 2 B > 2 B (coefficient  bound  of  I c (F) ) . 
k 

Let  n*  be  the  term  degree  bound  for  F and  n"  be  the  maximum 
term  degree  of  lc(F).  Now  let  n - n’  + n"  + 1 which  bounds 
all  the  term  degrees  of  lc(F)  F.  Then  for  such  k and  n,  any 
divisor  0 of  F can  at  mo9t  differ  from  the  canonical  repre- 

n 

sentative  of  its  equivalence  class  in  (J/S  ) (x)  by  a unit 


93 


n n 

in  (J/S  ) i.e.  D/lc(D)  (mod  q , S ) is  unit  normal  or  is 

k 

actually  the  canonical  clas9  representative. 

n 

By  Theorem  11-4.1,  ue  can  find  G and  H in  (J/S  ) lx] 

n n 

n 

from  F,  G , and  H such  that  F ■ G H in  (J/S  ) [x]  and 
11  n n 

G - G , H - H in  (J/S)  lx].  By  Lemma  11-5.1,  G and  G , 
n 1 n 1 n 

n 

H and  H are  associates  in  (J/S  ) tx] . Since  lc(F)  is  a 
n 

n 

unit  in  J/S,  hence  also  in  J/S  , 90  must  I c (G)  and  I c (H)  be 


units  in  J/S  . Since  I c (F)  - I c (G  ) I c (H  ),  I c (G  ) and 

n n n 

n 

I c (H  ) must  also  be  units  in  (J/S  ).  Then,  because  q and 
n k 

n 

S is  sufficiently  large,  we  have  H’  • I c (F)  H/lc(H)  in 

n n 

(J/S  ) txl  and  H"  - lc(F)  H / lc(H  ) in  (J/S  ) [xl  identically 

n n 

equal  or  equal  in  (Zly  , ...,y  Dlx).  But  I c (F) / 1 c (H)  » I c (G) 

1 v 

in  (Z(y  ,...,y  1 ) [xl  so  that  the  expression  H"  considered  as 
1 v 


a polynomial  in  (ZIy  ,...,y  Dlxl  must  also  be  divisible  by 
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lc(G).  In  fact,  since  F is  primitive,  so  must  H be,  and 
I c (G)  - cont(H’)  - cont(H"),  H » pp(H’)  >•  pp(H").  Similarly, 


G - ppdc(F)  (G  /!c(G  ))  (mod  q , S )).  (In  fact,  with  H 
n n k 

known  now,  I c (G)  ■ le(F)/lc(H)  ■ cont(HM)  and 

n 

G - cont(H")  (G  / I c (G  ))  (mod  q , S ) as  it  is  considered  as 
n n k 

a polynomial  in  (Zty  y 3 ) [x] . ) // 

1  v 

Example  1 1-5. 4a:  This  one  will  be  the  continuation 

from  Example  1 1-4. la  where  F was  given  to  be 

2 2 3 3 2 

x + (-z  -yz  + y + z + 2)  x + yz  - z + y z-yz  + 2z, 

b - 10,  01.  q - p - 7 and  the  univariate  codivisors  of  F, 

G - x and  H - x + 2.  Since  q - 7,  n - 3 happ.  ?d  to  be 
1 1 

large  enough,  and  F was  given  monic,  there  is  really  nothing 

much  to  continue.  The  results  of  applying  the  Generalized 

Hensel  Algorithm  to  F,  G , and  H are  G ■ x + z - y z and 

1 1 3 

2 3 

H - x + y-  z + 2 such  that  F » G H (mod  q,  S ).  But 

3 3 3 

there  is  no  need  for  performing  the  Restore  Leading  Coeffi- 
cient operations  on  G and  H because  they  are  monic.  Thus, 

3 3 

in  fact  F - G H - G H over  the  integers  with  G » G and 
3 3 3 


F 
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H - H . 

3 

Example  1 1 -5.4b:  Continuing  from  Example  11-4. lb,  ue 

have  the  given  non-monic  multivariate  polynomial 

2 2 2 2 2 3 3 

F-y  x +(-yz  - y z + yz  + y + 2 y)  x + y z - z 

2 

- y z - y z + 2 z ul  th  b • (1,  0) , p - 7,  G • x,  and 

1 

H - x + 3.  The  goal  here  is  to  find  the  factors  over  the 
1 

integers,  G and  H,  corresponding  to  G and  H in  CZ/p)  tx] . 

1 1 

Since  ue  know  that  k - 1 and  n - 4 is  sufficiently  large 

for  the  general  modulus,  the  results  from  Example  1 1 -4. lb, 

by  applying  the  Generalized  Hensel  construction  on  F,  G , 

1 

and  H , are  exactly  uhat  ue  need  here.  Ue  had 
1 

2 

G - (y*  + 2 y’  + 1)  x - (y’  + 1)  y’  z, 

4 

2 2 3 2 

H ■ x + 3 - 2 y'  + 2 y'  -z  - 2 y’  +y’z,  and 
4 

4 

F’  - G H (mod  7,  S’  ),  Ue  uill  nou  apply  the  Restore 
4 4 

Leading  Coefficient  operations  to  G and  H , uorking  with  F' 

4 4 

4 

in  the  more  convenient  modular  space  S’  (rather  than  F in 


S ).  For  simplicity  and  by  symmetry,  ue  first  compute 


G"  - lc(F’)  G / I c (G  ) in  (J/S*  Hxl.  Since 
4 4 


lc(G  ) ■ lc(F')  - (y’  + 2 y’  +1),  ue  have 

4 


4 2 

G"  - G (mod  7,  S’  ) - G - (y*  +2  y’  +J  x - (y*  +1)  y’  z. 
4 4 


Considering  G"  as  polynomials  over  Z (since  7 is  a suffi- 


ciently large  modulus)  ond  computing  the  primitive  part  of  it. 


ue  get  (y ’ + 1)  x - y’  z.  But  then  G can  be  recovered  from 


this  by  the  substitution  of  y - 1 for  y’  in  this  expression 


givingyx-yz  + z-G.  For  recovering  H,  ue  take  the 


corresponding  content  of  G",  y’  + 1,  and  get 


cont(G")  H / 1 c (H  ) (mod  7,  S’  ) 
4 4 


(y*  + 1)  x + 3 (y’  + 1)  - 2 y*  (y*  + 1) 


?.  2 3 2 

+ 2 y’  (y*  + 1)  - z (y’  + 1)  - 2 y’  + y’  z 


(y*  +1)  x +3  +y*  - z - (y*  + 1)  x + (y*  + 1)  + 2 - z 


yx  + y-  z + 2-H.  Then  ue  have  shoun  hou  the 


divisors  of  F over  Z can  be  recovered  from  their  univariate 


images,  G and  H , via  the  application  of  the  Generalized 
1 1 


Hensel  construction,  to  a sufficiently  large  modular  space 
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4 

(in  this  case  S ),  and  ffter  Restore  Leading  Coefficient 
operations  if  necessary.  The  lack  of  resemblance  between  G 

4 

and  G,  H and  H mentioned  in  Example  11-4. lb,  turned  out  not 
4 

to  be  a serious  hindr~  :e  to  the  recovery  of  G and  H.  But 
the  added  complications  and  computations  due  to  non-trivlal 
leading  coefficients  have  also  been  shown  to  be  potentially 
costly.  If  we  compare  the  computations  involved  in  Examples 
1 1 -S . 4a  and  11-5. 4b  as  well  as  in  Examples  11-4. la  and 
11-4. lb,  the  amount  of  extra  work  (for  example  the  substi- 
tution transformations,  the  one  additional  iteration 
necessary  for  the  Generalized  Hensel  construction,  and 
the  Restore  Leading  Coefficient  operation)  due  to  the  simple 

2 

but  non-trivial  leading  coefficient  y*  is  quite  substan- 
tial. This  problem  will  be  the  topic  of  discussion  in  the 
next  section  (II -5 (a) ) . 

Algor i thm  II-5.4:  (Multivariate  EZ  Algorithm) 

Input:  F(x,y ) primitive  in  (Zly  ,...,y  Dtxl, 

1 v 1 v 

b - lb  ,...,o  I a set  of  integers  in  Z such  that 
1 v 

( I c (F ) ) 4 0 (mod  p),  p a lucky  prime  for  F in  Z,  and 

b b 

G (x),  H (x)  relatively  prime  such  that  F - G H in 

11  11 


38 


(J/S) [*). 

Output:  If  F has  divisors  G and  H such  that  F ■ G H 'n 

Z lx, y , . . . , y 3 and  G ■ G , H ■ H in  (J/S) Ik]  , then  u and 
1 v 1 “ 1 

H mil  be  returned.  Otherwise,  the  outputs  wl  1 1 be  G , H , 

n n 


q . and  n such  that  F ■ G H (mod  q , S ). 
k n n k 


(1)  Compute  B as  suggested  in  the  above  Proof. 
Set  L <--  maximum  coefficient  magnitude  of  I c (F) . 


k <—  minimum  integer  such  that  q > 2 B L. 

k 


n’  < — term  degree  bound  of  F ■ max  td  . 

i i 


n"  <—  maximum  term  degree  o'  I c (F) 
n <--  n’  + n"  + 1. 


(2)  Apply  Generalized  Hensel  Algorithm  11-4.1  on  F,  b,  q , 

k 


G , and  H to  obtain  G (x,y  ,...,y  ) and  H (x,y  ,...,y  ) such 

1 1 n 1 v n 1 v 


n 


that  F - G H in  (J/S  ) lx]  and  G • G , H *•  H in 
n n n 1 n 1 


(J/S)  (x) . 

(3)  Restore  Leading  Coefficient: 


n 


Set  H <—  I c (F)  (H  / 1 c(H  ))  (mod  q , S ). 

n n k 


If  H divides  ( I c (F)  F)  over  Z,  then  set 


■ 
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H <--  pp (H)  over  (Zty  I)  lx], 

1 v 

n 

G <—  cont(H)  (G  / 1 c (G  ))  (mod  q , S ), 
n n k 

and  return  G and  H. 

(4)  Other:' i se  return  G , H , q , and  n. 

n n k 

II  - 5 (a)  The  Leading  Coefficient  Problem 

He  non  observe  that  the  basic  underlying  step  of  all  of 
the  Hensel-type  algorithms  is  solving  the  Diophantine  Univa- 
riate Polynomial  Equations  (DUPE)  of  the  form  A G + B H - C 
where  G,  H,  and  C are  given  polynomials  in  (Z/q)  [x] , 
gcd(G,  H)  ■ 1 and  q is  some  pouer  of  a prime  p.  As  in  the 
case  of  integral  Diophantine  equations,  solutions  to  DUPE  are 
not  unique.  Houever,  if  we  impose  the  condition  that  lc(G) 
be  a unit  in  Z/p  and  deg (A)  < deg(H)  then  the  solutions  A and 
B are  unique  up  to  units  in  Z/p.  Unfortunately,  this  artifi- 
cially imposed  condition  influences  the  Hensel-typ'3  of  con- 
struction process  so  that  the  leading  coefficient  of  one  of 
the  factors,  say  H,  is  never  updated  (see  Example  1 1 -4. lb) . 
For  instance,  in  the  Generalized  HenBel  Algorithm,  ue  have 

(1)  (k)  (m) 

H -H  + H +...+H  uhere  each  H i s a li  near 
k+1  1 

combination  of  the  A*  s uhich  are  computed  by  solving  corres- 
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ponding  DUPE’ s.  Since  deg (A  ) < deg(H  ) for  all  Im,  the 

Im  1 

leading  coefficient  of  H is  always  the  same  as  the  luading 

k+1 

coefficient  of  H which  is  only  congruent  to  the  leading  co- 
1 

efficient  of  the  true  factor  H modulo  3 but  usually  not  iden- 
tically equal  to  it  over  the  integers.  Thus,  as  we  have  seen 
in  the  univariate  and  the  multivariate  EZ  Thuorem,  for 

F - G H,  the  Hensel  constructions  result  in  G and  H for 

n n 

sufficiently  large  n such  that  F - G H and  G - G , 

n n n 

H « H , but  usual  ly  G / G and  H j H over  the  integers.  The 
n n n 

reason  for  such  a situation  is  that  essentially  all  of  the 

leading  coefficient  of  F has  been  artificially  forced  onto 

the  leading  coefficient  of  G . In  fact  If  the  leading  coef- 

n 

ficient  of  H were  divided  out  of  H , i.e.  H - H / I c (H  ), 

1 1111 

G ■ I c (H  ) G , and  still  F » G H (mod  S) , then  the  Hensel 

111  11 

construction  using  these  G and  H will  automatically  force 

1 1 


I c (G  ) » I c (F)  and  lc(H  ) ■ 1.  In  this  case,  the  Restore 
n n 

Leading  Coefficient  operation  becomes  simply  G’  ■ pp (G  ) and 

n 

H’  - cont(G  I H (mod  S ).  Thus,  in  general,  we  have 
n n n 


G ■ G /u  where  u - lc(H)  which  is  a unit  in  J/S  , and 
n 

F - G H - ( (1/u)  G ) (u  H ) where  multiplications  by  units 
n n 

n 

are  done  in  J/S  . Many  such  phenomena  (in  their  simplified 
versions)  can  be  seen  from  Example  11-5. 4b  and  its  proceed- 
ing related  examples.  Ue  now  point  out  that  If 


d d d 

u - 1 + y + y + ...  + y (mod  y ,y  , ...,y  ) then  1/u  is  a 
12  v 1 2 v 


dense  polynomial  in  the  y’s  of  degree  less  than  d In  each, 

v 

which  is  a polynomial  of  d terms.  The  relationships  of  G, 
n 

H,  G , and  H in  J/S  are  G - u G and  H • (1/u)  H.  This 
n n n n 

means  that  the  Hensel  method  is  reconstructing  the  factors  G 

i 

and  H where  each  coefficient  of  G is  multiplied  by  u and 
n 

each  coefficient  of  H is  multiplied  by  (1/u).  This  's  an 
expression  blouup  due  to  non-uniqueness  of  solutions  to 
DUPEs  relating  essentially  to  the  leading  coefficient  of  F. 
Ue  will  address  this  problem  as  the  Leading  Coefficient 
Blowup  (LCB)  problem. 

Referring  back  to  Theorem  11-5.2  where  we  assumed  F to 

be  monic,  we  noted  in  the  proof  of  that  theorem  that 

deg(C  ) < deg(R  ) < deg(F)  ■ deg(G  ) + deg(H  ).  Uith  this 
Im  m 11 


i.  ,w 
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condition  holding,  Corollary  11-2.2  implies  that  the  corres- 
ponding A and  B satisfy  deg(A  ) < deg(H  ) and 
Im  1 m 1m  1 

(■) 

deg(B  ) < deg(G  ).  Thus,  deg(G  ) < deg(G  ) and 
I m 1 1 

On) 

deg(H  ) < deg(H  ) so  that  !c(G  ) - lc(G  ),  lc(H  ) - lc(H  ), 
1 n 1 n i 

and,  in  fact,  lc(G)  - lc(G  ) *>  lc(H)  ■ lc(H  ) ■ lc(F)  » 1. 

n n 

The  correct  leading  coefficients  for  G and  H made  G and  H 

n n n n 

the  unique  canonical  representative  of  the  equivalence  class- 

n 

es  for  G and  H in  (J/S  ) lx]  so  that  G - G and  H - H over 

n n 

the  integers  when  n is  sufficiently  large  to  bound  the  term 

degrees  in  F.  G,  and  H.  Ue  now  observe  that  if  F is  monic, 

then  the  same  arguments  hold  even  if  he  Integral  arithmetic 

uas  carried  out  in  Z/q  for  k sufficiently  large  to  bound 

k 

all  the  numerical  coefficients  of  F,  G,  and  H (as  in  Theorem 
1 I-S.3  and  11-5.4).  Examples  11-4. la  and  11-5. 4a  clearly 
demonstrate  these  points.  Ue  further  note  that  the  impor- 
tant condition  is  not  that  F is  monic  but  rather  that 

1 c (G)  - I c (G  ) and  lc(H)  ■ lc(H  ).  (See  Remark  following 
n n 

Theorem  11-4.1.)  This  leads  us  to  the  next  lemma  which, 
though  only  constitutes  an  artificial  solution  to  the  LCB 


103 


problem,  uill  later  prove  to  avoid  the  LCB  problem  at  a 
3mall  cost  for  some  Applications  of  the  Hensel  construction. 
Houever,  ue  uill  emphasize  that  this  method  of  treating  the 
leading  coefficients  is  indeed  "sparseness-preservlng". 

For  univariate  polynomials  a similar  version  of  the 
follouing  lemma  applies.  Since  ue  have  pointed  out  the 
effects  uhen  F is  monic  in  the  remark  follouing  Lemma  II- 
3.2,  ue  uiN  nou  concentrate  only  on  the  multivariate  case. 
In  fact,  the  Restore  Leading  Coefficient  operation  for 
univariate  polynomials  essentially  only  involves  the 
additional  step  of  computing  integral  contents  and  primitive 
parts  that  it  is  a much  less  expensive  operation  than  in  the 
multivariate  case. 

Lg 1111113  II-5.5:  Let  F(x,y  ,...,y  ) be  in 

1 v 

(Zty y 1 1 (xj  and  primitive  u.r.t.  x.  Let 

1 v 

b ■ ,b  b 1 be  a set  of  integral  values  and  p be  a lucky 

1 v 

prime  for  F in  Z such  that  (lc(FI)  - 0 (mod  p) , Assume 
b b 

there  exist  G and  H in  (Z(y  ,...,y  IHx]  such  that  F - G H 

1 v 

and  G ■ G (mod  q , S),  H ■ H (mod  q , S)  are  relatively 
1 k 1 k 

prime  in  (Z/q  Hxl  or  (J/S)  (xj  uhere  J - (Z/q  Hy ]. 

k k 1 v 

Then,  with  F,  G , H , I c (G) , and  lc(H),  ue  can  apply  a modi- 


i 


wm 


104 


fied  version  of  the  Generalized  Hen9el  Algorithm  30  that  the 
outputs  of  it  are  G and  H. 

Proof:  Before  we  proceed  with  the  proof,  we  define  a 
replacement  function,  the  U9e  of  which  uas  f. i r 3 1 suggested 
by  Mores,  replacelctP.  L)  - P with  its  leading  coefficient 
replaced  by  L.  As  in  proof  of  Theorem  1 1-4.1,  we  have 

i 

F « G H (mod  q , S)  so  that  A and  B can  be  found  in 

II  k 11 

(J/S)  (xl  9uch  that  A G + B H ■ 1 and  deg(A  ) < deg(H  ), 

1111  1 1 

deg(B  ) < deg(G  ).  Now  let  G’  - replacelctG  , I c (G) ) and 

III  1 


H*  » replacelc(H  , IclH)).  For  the  mth  step  of  the  indue 
1 1 

(1)  (IB-1) 

tive  process,  we  use  G’  - G'  + G + ...  + G and 

ffl  1 

(1)  (m-1) 

H*  - H’  + H + ...  + H to  compute 
m 1 


#+! 

R - F - G’  H’  (mod  q , S ).  Thus  we  have 
m mm  k 

deg(C  ) < deg(R  ) < deg(F)  - deglG  ) + deglH  ) since  non 
I m Ki  11 

lc(G’)  lc(H’)  ■ I c (F)  over  *he  integers.  However,  9ince 
m m 

G*  ■ G and  H’  in  J/S.  we  can  still  apply  Algor i thm 
1111 

1 1-2.2  on  G , H , A , B , and  C in  J/S  to  compute  A and 
1111  Im  im 
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B 9uch  that  A G + B H - C . But  now  Coro' I ary  2.2 
In  tin  1 1ml  In 


implies  deg(A  ) < deg(H  ) and  deg(B  ) < deg(G  ).  Hence 
Im  1 Im  1 


(m)  (m) 

deg(G  ) < deq (G  ),  deg(H  ) < deg(H  ) and 
1 1 


I c (G*  ) - lc(G’)  - ic(G’)  - I c (G) , I c (H*  ) - lc(H).  There- 

m+1  m 1 m+1 


fore,  for  sufficiently  large  k and  n (see  proof  of  Theorem 


1 1-5.4),  we  have  F - G’  H’  (mod  q , S ).  But  because  of 

n n k 


the  identities  in  the  leading  coefficients,  we  must  have 


G ■ G’  and  H * H*  over  the  integers.  II 
n n 


Example  11-5.5:  To  demonstrate  the  feasibility  and 
the  necessary  additional  operations  of  this  modified  Hensel 
construction,  we  work  out  a problem  which  is  closely  related 


to  that  of  Example  1 1 -5. Ab.  Let  F be  the  product  of  y and 
the  given  polynomial  of  Example  1 1-5. 4b,  so 


42  2 2 4 3 4 3 33 

F-y  x +(-y  z -y  z + y z + y + 2y)x  + y z 


2 3 4 3 2 

-y  z -y  z-y  z + 2y  z. 


Then,  over  the  integers,  F has  codivisors  G and  H where 


2 2 

G ■ y(y  x + z - y z)  ■ y x + yz-y  z and 


2 2 2 2 

H - y(y  x + y-  z 2)  - y x + y - yz  + 2y. 


- - ■ 


Uith  b - 11.  0)  and  p - 7,  G and  H are  the  same  as  those 

1 1 

in  Example  11-5.41;,  where  G - x and  « x + 3.  Here,  we 

2 2 

assume  it  is  known  that  lc(G)  ■ y and  lc(H)  ■ y are  the 
correct  multivariate  leading  coefficients  of  and  H^.  So 

we  apply  the  replacement  function  to  get 

2 

G’  - replacelclG  , lc(Gl)  ■ y x and 
1 1 

2 

H’  - replacelclH  . lc(Hl)  - y x + 3.  As  in  Example 
1 1 

II -5. 4b,  ue  get  the  same  A , B , A , B , A , and  B 

00  00  01  01  02  02 

i 

such  that  A G + B H - x (mod  7)  for  1-0.  1.  2. 

0i  1 01  1 

Another  preparatory  step  for  the  Hensel  construction  Is, 
as  before,  the  substitution  transformation,  y’  + 1 for  y. 

In  this  case,  we  apply  this  transformation  to  G* , H\  and  F 


to  get  G - G'  (y-y’+l,  z)  ■ ly'  + 2 y +1) 
1 1 


First  we  compute  0 - F'  - G’  H’  and 
1 1 1 


R * D (mod  7,  S’  ).  Ue  get  R • -3  y’  x » C lx)  y' , 
11  11 


Thu9,  simply  A (x)  ■ -3,  B (x)  - 0 such  that 
1 1 


(1) 


A G + B H ■ C (mod  7)  so  that  G » B y’  ■ 0, 
11111  1 


(1) 


H - A y’  - -3  y\ 
1 


(1)  2 

G'  - G*  + G * G’  » (y'  + 2 y'  + 1)  x,  and 
2 1 1 


(1)  2 

H’  » H’  + H - (y*  + 2 y'  + II  x + 3 - 3 y'  such  that 
2 1 


G’  H’  - F*  (mod  7,  S’  ).  Next  we  compute 
2 2 


(1)  (1)  (1)  (1) 

D - D - G’  H - G H’  - G H and 
2 11  1 


3 2 .2 

R - D (mod  7,  S’  ) - (y’  - y’  z - z ) x - 3 y’  z 
2 2 


2 2 
(x)  y*  + (-x  - 3)  y’  z + (-x)  z 


2 2 

- C (x)  y’  + c (x)  y’  z + C (x)  z . By  scalar  linear 
11  12  22 


combinations  using  A , B i - 0,  1,  2,  we  get  A - 1, 
0i  0i  11 


i 


I . 0,  A - 0,  B ■ -1,  A ■ -1,  and  B -0.  Thus 

II  12  12  22  22 

(2)  2 2 
; - B y’  + B y’  z + B z - -y*  z and 

11  12  22 

(2)  2 2 2 2 
^ .A  y*  + A z - y*  - z , so  that 

11  22 

(2)  2 

G’  - G’  + G ■ (y*  + 2 y*  + 1)  x - y*  z, 

3 2 

(2)  2 22 

H*  - H*  + H - (y*  + 2 y*  + 1)  x + 3 - 3 y’  + «’  - Z 

3 2 

3 

and  G*  H*  - F*  (mod  7,  S’  ).  Finally,  mb  cdmpute 
3 3 

(2)  (2)  (2)  (2) 

D - D - G’  H - G H*  - G H and 
3 2 2 2 

4 2 2 2 

R - D (mod  7,  S’  ) - (-yf  z - y’  z ) x - 3 y*  z 
3 3 

2 2 
. ( — x -3)  y’  z + (-x)  y*  z 


2 2 

* C (x)  y’  z + C (x)  y’  z . it  is  easily  computed 
112  122 


that  A * 

0,  B ■ - 

■1, 

A - -1, 

and  B 

- 0. 

Thus 

112 

112 

122 

122 

(3) 

2 

2 

(3) 

G ■ B 

y’  z ■ -y' 

z and  H 

-A  y* 

z - 

-y’  z , 

112 

122 

(3) 

2 

2 

so  that  G’ 

4’  +G 

■ 

(y*  + 2 y* 

+ 1)  X - 

y*  z 

- y*  z 

4 3 


(3)  2 2 2 2 

H’  - H’  + H - (y*  +2  y’  +1)  x + 3 -3  y’  + y’  z -y’  z , 
4 3 


such  that  G’  H’  - F (mod  7,  S’  ).  In  .act, 

4 4 

0 - F - G’  H’  - 0,  i.e.  F*  - G'  H’  over  Z,  eo  the  Hensel 
4 4 4 4 4 

construction  terminates  here  and  gives 

2 2 

G - G’  (y’-y-l,  z)  - y x + yz-y  z and 
4 

2 2 2 

H ■ H’  (y’-y-l,  z)«y  x + y - y z + 2yas  results, 

4 

which  is  indeed  the  correct  codivisors  of  F over  Z.  Note 

that  only  three  iterations  of  the  Hensel  construction  were 

needed  in  this  case,  where  the  term  degree  bound  for  F is 

actually  G,  so  that  even  though  the  problem  is  a larger  one 

than  Example  1 1 -5. 4b,  the  number  of  iterations  remain  the 

same.  Therefore,  we  have  shown  with  this  example  that 

knowing  the  correct  multivariate  leading  coefficients  avoids 

the  LCB  problem  (Compare  the  number  of  terms  in  H’  with  that 

4 

of  H in  Example  1 1 —5. 4b. ) at  the  co3t  of  working  with  a 
4 

larger  given  polynomial  but  without  having  to  go  through  any 
more  iterations  of  the  Hensel  construction  than  before. 

This  is  definitely  an  improvement  over  the  regular  Hensel 
construction  so  long  as  we  can  be  given  the  Xnowledge  of  the 


lie 


correct  way  of  splitting  the  leading  coefficient  of  the 
given  polynomial  into  the  leading  coefficients  of  i ts  uni- 
variate codivisors.  In  Chapter  IV,  we  will  see  how  this 
Knowledge  can  actually  be  obtained  at  a small  cost  thereby 
avoiding  large  cost  due  to  the  LCB  problem  and  preserve  the 
sparsenes9  of  the  polynomials  during  the  Hensel  construction. 
Algor i thm  1 1 -5 . 5 1 

Input:  Same  as  those  for  Multivariate  EZ  Algorithm  II-5.4 


except,  in  addition,  g and  h in  ZIy  , ...,y  1 are  given  such 

1 v 


that  g - I c (G  ) and  h ■ lc(H  ). 
b 1 b 1 


Output:  If  F has  divisors  G and  H such  that  F ■ G H and 

lc(G)  - g,  I c (H)  - h,  then  G and  H wi 1 1 be  returned.  Other- 


wise the  outputs  will  be  G , H , q and  n such  that 

n n k 


F - G H (mod  a , S ). 
n n k 


(1)  Compute  bounds  k and  n as  in  Step  (1)  of  Algorithm 
1 1-5.4. 

(2)  Return  the  result  of  applying  GHA  on  F, 


G’  - replacelc(G  , g),  and  H’  - replace(H  , h). 
1 111 
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CHAPTER  III  POLYNOMIAL  GREATEST  COMMON  DIVISORS 
III  - 1 I ntroduc t i on 

As  already  noted,  polynomial  GCO  computations  basically 
rely  on  division,  especially  for  the  PRS  GCD  algorithms. 

But  the  process  of  finding  the  GCD  of  two  polynomials,  F and 
G,  also  has  many  features  in  common  with  polynomial 
factorization.  Clearly,  if  ue  have  the  complete 
factorizations  of  F and  G,  then  their  GCD  can  simply  be 
recognized  by  taking  the  common  factors  raised  to  the 
respective  minimum  pouers.  Even  uithout  the  complete 
factorizations,  though,  the  GCO,  D,  of  F and  G still  breaks 
up  F and  G into  incomplete  factorizations,  F - D (F/D)  and  G 
= D (G/D).  It  is  these  factorizations  that  we  usually  seek 
in  GCO  computat .ons,  uhere  ue  often  require  not  only  the 
GCD,  but  also  the  cofactors,  F/D  and  G/D  (for  example,  in 
reducing  F/G  to  lowest  terms).  The  Hensel-type  GCD 
algorithm  will  always  compute  the  cofactors  as  a byproduct 
of  the  built-in  trial  division  process,  just  as  Broun’s 
Modular  GCD  Algorithm  will  obtain  them  as  a byproduct  of  the 
process  of  verifying  the  GCD. 

The  Hensel-type  GCD  algorithm  uses  the  modular  and 
evaluation  homomorphisms,  just  as  the  Modular  GCD  Algorithm 
does.  The  main  distinction  is  in  the  process  of  inverting 
the  mappings.  Here,  ue  use  the  Hensel-type  constructions 


which  make  good  use  of  one  well  chosen  homomorphic  image 
instead  of  several  images  required  by  the  Chinese  Remainder 
Algorithm  and  the  interpolation  process.  This  i9  actually 
the  only  distinction  in  the  univariate  case.  For 
multivariate  GCD  computations,  however,  the  Generalized 
Hensel  Algorithm  enables  the  simultaneous  constructions  (or 
inversions)  in  several  variables  instead  of  recursively 
inverting  one  variable  at  a time.  This  is  why  the 
multivariate  EZGCD  Algorithm  often  compares  much  more 
favorably  to  other  GCO  algorithms  than  the  univariate  UNIGCD 
Algorithm  would.  For  this  reason  (and  because  the 
computation  of  univariate  GCDs  are  not  so  costly,  anyway) 
the  univariate  case  will  be  given  less  emphasis  in  the 
following  di scussion. 

Ill  - 2 Basic  Concepts  and  an  Overview  of  EZGCD  Algorithm 
Ue  will  now  present  the  four  basic  steps  of  the  GCD 
algorithm  by  Hensel-type  constructions.  This  overview  will 
shou  the  similarities  of  this  method  with  the  Modular  GCD 
Algorithm  and  with  Uang  and  Rothschild's  polynomial 
factorization  algorithm.  Then,  we  will  also  prove  a 
fundamental  theorem  which  will  9how  why  this  new  method  for 
GCD  computation  is  possible.  For  both  of  these  tasks  we 
will  mainly  discuss  the  more  important  multivr-riate  case  and 
only  point  out  that  the  univariate  case  follows  similarly 
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and  more  3 i mp I y so  that  it  need  not  be  repeated  here. 

First,  we  prove  a result  that  is  essential  fo~  the 
understanding  of  GCO  computations  via  homomorphic  images  in 
more  structured  subdomains.  This  result  is  implicitly 
stated  in  Brown' 9 I0RO71]  discussions  on  "unlucky"  primes 
and  b-values.  and  its  proof  can  be  implicitly  derived  from 
his  Theorem  1 and  4 if  one  understands  the  theory  of 
polynomial  FRS’s  and  subresul tants.  But  its  importance  in 
our  ensuing  discussions  warrants  its  individual  treatment 
here. 

^enima  1 1 1-2.1:  Let  F and  G be  in  J(x]  where  J is  any 

u.f.d.  Let  h be  a homomorphism  from  J to  a subdomain  of  J 
whose  kernel  is  some  ideal  I of  J,  i.e.  h:  J — > J'  . J/I. 
Assume  h(lclF))  I 0 and  h( lc (G) ) .<  0.  If  0 - gcd(F,  G)  and 
D - gcdlh(F),  h (G) ) uhere  h(F)  means  applying  h on  all 
coefficients  of  F,  then  deg (0 Cx) ) < deg(O’lx)). 

Proof:  Since  h is  a homomorphism,  0 divides  F and  G so  that 

there  exists  F’  and  G’  in  J such  that  F . 0 F'  and  G - D G’ 
— > h(F)  - h(D)  h(F')  and  h (G)  - h(0)  h(G’)  or  h(D)  divides 
h IF)  and  h(G)  -=>  h (0)  divides  O’,  by  definition  of  GCD  — > 
deg { h (0 ) ) < deg (0* (x) ) . But  hCIcCD))  also  - 0.  for  if 
h ( I c (D) ) = 0 then  since  I c (D)  divides  lc(F)  and  I c CG)  so 
I c (F ) - lc(G)  » 0,  contradict ing  assumpt ions.  Therefore 
deg(Dlx) ) - deg (h(D) ) < deg(0’ (x)>.  II 


. ...  ....  1-  , 


In  our  applications,  uo  have  the  modular  homomorphi sms 

determined  by  a prime  p and  the  evaluation  homomorphi sms 

determined  by  a chosen  set  of  integers  b (b  , ...,b  I. 

1 v 

According  to  Broun,  the  particular  chosen  p or  b is  lucku 
if  the  homomorphism  determined  by  it  satisfies 
deglh(D))  > deg(0’  - gcd(h(F),  h (G) ) ) , and  un lucku  other- 
uise.  Broun’ s Theorem  1 and  4 in  IBRD7U  give  another  ver- 
sion of  the  definition  of  the  "luckiness"  in  terms  of  sub- 
resultants  of  tuo  polynomials.  In  general,  ue  ui I I call  a 
particular  homomorphism  lucky  for  a single  polynomial  P if 
the  homomorphic  images  of  any  tuo  distinct  factors  of  P are 
not  the  sai'.e  in  the  image  domain. 

ihe  EZGCD  Algorithm  uill  compute  the  GCD, 

0'(x,y  y ),  of  tuo  multivariate  polynomials 

l v 

FMx.y y I and  G’(x,y  ,...,y  ) in  Zlx.y y ] uith 

1 v 1 v 1 v 

the  cofactors  F’/D’  and  G’/O’  as  byproducts  of  the  GCD  veri- 

f icat ion  process. 

Step  I:  (Contents  and  Primitive  Parts) 

Consider  F*  and  G'  as  polynomials  in  (Zty y 1 ) 1x3 

1 v 

and  set  f’  <--  cont(F'),  F <--  pp(F’),  g’  <—  cont(G’), 
G <--  pp(G’),  and  d’  <--  gedff’,  g’). 

The  computation  of  content  and  primitive  part  i9  a recursive 
process,  each  step  involving  calculations  of  GCD’s  of 


polynomials  with  one  less  variable.  As  we  will  see  later, 
these  computations  can  be  accomplished  using  a variant  of 
^his  algorithm.  The  remaining  task  for  this  algorithm  is  to 
compute  D « gcd(F,  G),  uhere  F and  G are  now  primitive,  and 
the  corresponding  co factors  F/Q  and  G/D.  Then  D*  is  simply 
d’  D and  the  corresponding  cofactors  are  (f’/d')  (F/D)  and 
(g’/d*)  (G/D). 

Step  II:  (Evaluation  and  Univariate  GCD) 

Choose  a set  of  v integers  b - (b fj  1 such  that 

1 v 

the  degrees  in  x of  F and  G evaluated  at  y - b , 

i i 

i-1 v are  not  decreased,  [Such  an  evaluation  is 

called  a valid  evaluation.) 

Compute  F (x),  G (x),  and  0 (x)  ■ gr;d(F  , G ). 
b b 0 b b 

This  univariate  GCD  computation  can  be  done  by  using,  say, 

the  Modular  GCD  Algorithm  uhich  is  very  efficient  in  this 

case.  Houever,  it  is  equally  efficient  to  use  the  UNIGCD 

Algorithm  with  Zassenhaus’  quadratic  extension  Algorithm  in 

most  cases,  as  ue  will  see  in  the  next  section. 

Step  III:  (Preparation  for  Hensel  Construction) 

If  deg(D  ( x ) ) - 0,  then  D (x)  ■ 1 and  D - 1. 

0 0 


1 


If  deg(D  ) - deg(F  ) or  deg(G  ),  then  F divides  G,  or  G 
0 b b 


divides  F,  or  a new  valid  evaluation  should  be  made, 


.!c*.  r.'.. 
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Clearly  if  deg(0  ) - 0.  then  D - constant.  Since  b is  a 

0 0 

valid  evaluation  for  F and  G,  it  must  also  be  valid  for  D 

and  so  D - constant  implies  0 contains  no  x.  But  F and  G 
0 

are  primitive  implies  0 is  also  primitive.  Hence  0 must  be 

1.  Because  deg(0  ) > deg (0)  (see  Lemma  1 1 1-2.1), 

0 

if  deg(D  ) - deg(F)  < deg(G)  and  F does  not  divide 
0 

G.  ue  Know  deg(D)  < deg(0  ) so  that  this  evalua  icr: 

0 

must  be  unlucky  so  as  to  create  an  univariate  GCO  which  is 

not  the  image  of  the  actual  GCO.  Similarly, 

deg (0  ) = deg (G)  < deg(F) . 

Pi 

Otherwise  determine  if  the  following  condition  holds: 

Condition  1 1 1 -A:  gcdlO  , F /0  ) - 1 or 

0 b 0 

gcd(0  , G /0  ) - 1. 

0 b 0 

This  is  a necessary  condition  for  the  application  of  the 
Generalized  Hensel  Algorithm,  as  we  have  seen.  This  condi 
tion  will  not  hold  if  gcd(0,  F/D)  / 1 and  gcd(D,  G/D)  / 1. 
(The  simplest  class  of  problems  for  which  this  is  true  is 

2 2 

the  case  where  F « U and  G * U V,  U and  V are  any  non- 
trivial polynomials  depentent  on  the  main  variable  of  the 
problem  in  question).  If  this  is  the  case,  the  algorithm 
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wiii  be  forced  to  apply  a special  case  method  which  amounts 
to  computing  a square-free  decomposition  of  D.  Houever, 
this  special  method  also  uses  the  Hensel-type  constructions 
and.  in  many  cases,  proves  to  be  comparable  in  cost  'o,  if 
not  more  efficient  than,  the  other  GCD  algorithms.  This 
special  method  will  be  discussed  and  especially  clearified 
when  we  present  the  Hensel-type  square  free  decomposition 
algorithm  in  Chapter  VII. 

Assuming  Condition  111 —A  holds,  we  continue  under  a further 
assumption. 

Condition  111-8:  deg(0  ) - deg(D). 

0 

Since  D is  supposedly  unknown,  this  assumption  cannot  be 
tested  at  this  point.  However,  because  it  is  an  essential 
condition  for  the  algorithm  to  work  (see  the  following 
theorem),  at  various  places  in  the  algorithm  we  endeavor  to 
increase  the  probability  'if  having  it  hold.  He  also  use 
a "safety-valve"  test  near  the  end  of  the  algorithm,  to 
ensure  that  any  unlucky  evaluation,  uhich  causes  Condition 
1 1 1 -B  to  t’  false,  will  be  detected. 

Step  IV;  (Application  of  the  Hensel  Construction) 

Suppose  F satisfies  condition  III -A. 

Apply  the  Hultivariate  EZ  Algorithm  (11-5.4)  on  F, 

0 (x),  and  F (x)/D  (x)  to  get  either 
0 b 0 
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(a)  multivariate  coractors,  D"  and  H",  of  F such  that 
F - 0"  H"  over  Z,  or 

(b)  some  q , n,  and  D , H such  that 

k n n 

n 

F * 0 H (mod  q , S i . 
n n k 

He  note  here  that  only  F (or  G)  is  used  in  the  application 
of  the  Hensel-type  construction.  This  will  be  the  funda- 
mental observation  for  the  new  and,  in  most  cases,  improved 
Hensel-type  polynomial  content  and  primitive  part  algorithm 
which  we  will  present  later. 

Step  V:  (GCD  Verification) 

For  case  (a),  test  uhether  0"  also  divide  G.  I f so 
D * 0"  is  the  GCD  of  F and  G we  seek,  so  that 
0’  = d’  D and  the  cofactors  can  be  computed  by  multi- 
plying quantities  already  computed.  Otherwise,  or  for 
case  (b),  go  back  to  Step  li  for  a neu  evaluation  and  a 
resulting  univariate  GCD  having  a smaller  degree  than 

this  0 . 

0 

From  the  properties  of  the  Multivariate  EZ  Algorithm 

(Theorem  II-5.4),  0"(x)  • 0 (x)  and  D"  is  a proper  primitive 
b 0 

divisor  of  F.  Thus,  s'nce  deg(0")  - deglO  ) > deglD), 

0 

0"  must  be  the  GCD,  0,  of  F and  G if  it  also  divides  G. 


mm 


I 


This  will  oe  made  more  explicit  in  Theorem  111-2.3. 

For  case  (L).  since  q and  n are  chosen  to  be  surely  larger 

k 

than  the  coefficient  bound  and  the  term  degree  bound  of  F 
respectively,  b must  be  an  unlucky  evaluation  so  we  look  for 
another.  In  case  it  was  decided  to  use  a heuristic  coeffi- 
cient bound  for  the  computation  of  q in  the  multivariate  EZ 

k 

algorithm,  then  case  (b)  does  not  necessarily  mean  that  the 

chosen  evaluation  was  unlucky.  It  may  be  worthwhile  to 

apply  the  Hensel  construction  again  with  a larger  integer 

modulus  than  q (perhips,  an  actual  coefficient  bound), 
k 

However,  for  practical  multivariate  problems,  it  is  rarely 
the  case  that  any  reasonable  heuristic  coefficient  bound 
such  as  the  one  suggested  in  the  EZ  algorithm  will  not 


suffice. 

Example  111-2.1:  As  an  example  of  multivariate  poly- 
nomial GCO  computation  using  the  above  out',  ined  EZGCD  Algo- 
rithm, we  assume 

2 2 3 3 2 

F’=x  +(-z  -yz  + y + z + 2)x  + yz  - z + y z 

- y z + 2 z and 

3 2 2.  3 3 

G’  - x +(-z  -yz  +z)x  + (y  z - z - y + 2)x 


+ y z - 3 y z + 2 z are  given  and  we  want  to  find 
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O'  « gcd(F’,  G’).  Since  F and  G are  ironic,  they  must  be 
primitive,  so  that  Step  1 can  be  passed  through  with 
d*  - gcd(f\  g* ) - gcd(l,  1)  - 1,  F - F*  and  G - G\  Ue 
choose  b » (0,  01  which  is  clearly  a valid  evaluation  for 

2 

monic  polynomials  F and  G.  Then  F (x)  - x + 2 x, 

b 

3 

G ( x ) = x + 2 x,  and  0 ■ gcd(F  , G ) ■ x.  Since 
b 0 b b 

deg(D  lx))  - 1 / deg(F)  or  deg(G),  we  test  Condition  III -A 

0 

and  find  gcd(0  , F /Q  ) ■ gcd(x,  x+2)  ■ 1.  So  we  continue 
0 b 0 

to  Step  III  assuming  Condition  II1-B,  deglD  ) - 1 - deg(D) 

0 

where  0 » gcd(F,  G)  is  to  be  found.  Ue  now  apply  the  Mul- 
tivariate EZ  Algorithm  (11-5.4)  on  F,  D (x)  - x,  and 

0 

F (x)/D  (x)  « x + 2.  But  this  problem  was  already  done  in 
b 0 

Example  1 1-5. 4a  and  the  results  are  D"  ■ x + z - y z and 

n 

L. 

H"=x+y-z  +2  such  that  F - 0"  H"  over  Z (case  (a)  of 
Step  IV).  Since  0"  also  divides  G 
2 2 

(G  » 0"  (x  - z x - y + 2)  over  Z) , ue  conclude  that 
O'  » 0 = 0"  » gcd(F,  G)  ■ gcd(F’,  G’)  and  indeed  it  is. 


Ue  will  now  prove  the  fundamental  theorems  for  the 
validity  of  the  application  of  the  Hensel-type  construction 
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to  polynomial  GCO  computations.  Again,  in  order  to  avoid 

redundancy,  we  concentrate  on  the  multivariate  case. 

Let  F(x,y  ,...,y  ) and  G(x,y  ,...,y  ) be  two  multiva- 
1 v 1 v 

riate  polynomials  over  the  integers  with  GCD  Dlx.y  y 


). 


1 v 


For  a set  of  integral  values  b ■ (b  ,...,b  ) such  that 

1 v 

deg (F  ( x ) ) * deg(F  (x()  and  deg(G(x)I  - deg(G  (x))  fi.e. 
b b 

1 1 c (F) ) / 0 and  ( I c (G) ) 4 0.1,  let 

b b 

D (x)  ■ gcd(F  (x) , G (x) ) . 

0 b b 

Theorem  I II -2. 2: 

If  D (x)  is  assumed  to  have  the  following  properties: 


(i)  deg(D  (x))  « deg (D (x) ) , 

0 


(ii)  gcd(D  (x)),  F (x)/D  (x))  « 1, 

0 b 0 

then  for  any  primitive  divisor  0’ (x.y  ,...,y  ) of  F such 

1 v 

that  O’ (x)  =0  (x) , we  must  have 
b 0 

CHx.y  ,.,.,y  ) = O' (x,y  ,...,y  ) Q(y  ,...,y  ) uhere  Q is  a 
1 v 1 v 1 v 

polynomial  independent  of  x. 

Proof.  Refer ing  to  Lemma  1 1 1-2.1  and  its  proof,  we  have 


0 (x)  dividing  D (x)  -D’(x),  deg  (0)  - degID  (x) ) , and. 
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similarly,  deg  ID’)  * deg(Q* (x)).  By  (i),  we  have 

b 

deg(D  M)  - deg(D)  - degID  lx))  - deg(D*  (x))  - deg(D’). 
b 0 b 

Let  F » D F'  and  G - D G\  then 

D (x)  - D (x)  gcd(FMx).GMx)),  and  so  gcd(F\  G’ ) - c,  c a 
0 b b b b b 

constant.  That  is  0 (x)  ■ O' (x)  * c D (x).  Non  let 
0 b b 

p (x,y  ) - gcd(0,  DM.  then  D - P Q and  D’  - P Q’  for 

1 v 

some  Q and  Q’  in  Z[x,y  1 such  that  gcdlQ,  QM  - 1. 

1 v 

Since  D*  is  primitive,  ti’  must  also  be  primitive  so  that 

either  Q’  - 1 or  deg(Q’)  > 0.  Assuming  the  latter,  we  have 

P Q dividing  P CT  (F/OM  or  Q dividing  Q’ (F/DM,  since  D 

divides  F.  But  gcdlQ,  O’)  ■ 1 so  that  Q divides  F/D  , hence 

Q divides  F /D * - F /D  . On  the  other  hand, 
b b b b 0 

P Q’  =D’=l'  = cD  • c D Q orQ  divides  D . Thus  Q 
bbbd  bbbb  0 b 

divides  gcd (0  . F /D  ) - 1,  by  Hi),  or  Q - 1 which  means 
0 b 0 t> 

Q’  = c contradicting  the  fact  that  deg(Q’  (x) ) - deg(Q’)  > 0 
b b 

by  assumption.  Therefore  we  must  f ^ve  Q’  » 1 or  0 ■ 0 Q. 

But  deg(D’)  - deg(O)  so  that  deg(Q)  0 and  we  have 

Dlx.y y ) - DMx.y y ) Q(y y ) where  Q is 

1 V 1 V 1 V 

independent  of  x.  // 
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IfaSPrem  III  - 2.3s  (Validity  of  the  EZGCD  Algorithm) 


Let  F (x,y  y ) and  G’ (x,y  ,...,y  ) be  multivariate 

1 v 1 v 


polynomials  in  Ztx.y  ,...,y  ] with  GCO  D’(x,y  y ).  Let 

1 v 1 v 

F - pp(F’),  f*  - cont(F’),  G - pp(G’),  g’  - cont(G’),  and 

0(><.y * * D'/gcd(f’,  g’).  For  any  chosen  eet  of 

1 v 

integral  values  b - lb  ,...,b  ) satisfying  the  conditions 

1 v 

deg (F  (x))  ■ deg(F)  and  deg!G  (x) ) *-  deg(G)  and  resulting  in 
b b 

a D (x)  « gcdlF  (x),  G (x))  satisfying  (i)  and  (ii)  of 
0 b b 

Theorem  III-2.2,  the  0"(x,y  ,...,y  ),  which  i3  obtained  from 


1 v 

0 (x)  via  the  application  of  the  multivariate  EZ  Algorithm 
0 


(I 1-5.4)  on  F,  0 , and  F /0  , must  satisfy  D - D"  or 
0 b 0 

(*.y y ) - 0”(x,y u ) gcd(f\  g’). 

1 v 1 v 

Proof:  Apply  the  Multivariate  EZ  Algorithm  on  F,  D , 


0 

and  F /D  to  get  D".  Since  the  evaluation  mapping  determined 
b 0 

by  b is  a homomorphism,  0 (x)  divides  D (x)  or  D - D C. 


b 0 0b 

But  (i)  implies  deglO  ) - degID  ) -o  that  C must  be  a con- 

0 b 

stant.  But  C divides  0 , in  fact,  C divides  I c (D  ),  so  C 


.j 


s 

i 


VKM» 
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must  be  a unit  modulo  the  particular  prime  p of  the  EZ  Algo- 
rithm. Thus  we  have  0-0  (mod  q , S)  and  so 

0 k 

F/0  - F /0  ' (mod  q , S).  Therefore,  by  Theorem  1 1-5.4  the 
b 0 k 

EZ  Algorithm  (1 1-5.4)  yields  0"  which  is  primitive  and 
divides  F.  By  the  result  of  Theorem  1 1 1 -2.2,  0 - D"  Q.  But 
F and  G being  primitive  implies  0 must  also  be  primitive,  so 
that  0,  being  independent  of  x,  must  be  1.  That  is  D-D" 
and  O’  - 0"  gcd(f\  g’).  II 

III  - 3 Univariate  GC0  Algorithm 

with  Zassenhaus’  Quadratic  Constructions 
Computing  the  GC0  of  univariate  polynomials  using  the 
Hense  I -type  construction  does  not  demonstrate  the  full  power 
of  the  Hense  I method.  Houever,  it  does  serve  the  purposes 
of  showing  that  (1)  in  many  cases  it  is  not  necessary  to 
compute  more  than  onu  homomorphic  image  of  the  GCD  in  order 
to  "in-ert"  the  mapping  and  get  the  actual  ansuer,  (2)  this 
method  also  avoids  the  coefficient  grouth  problem  of  the  PRS 
algorithms  and  at  the  same  time  retains  the  unique  advantage 
of  the  Modular  Algorithms  for  speedy  detection  when  the  GCD 
is  actually  1,  and  (3)  it  provides  a good  conceptual  basis 
for  understanding  the  multivariate  EZGCD  Algorithm  and  its 
related  problems,  since  ths  two  cases  are  basically  similar 


in  many  aspects. 
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Now  that  ue  have  established  some  theoretical 


foundations  for  the  utilization  of  the  Hensel-type 


construction  in  GCO  computations  in  the  last  section,  we 


will  directly  present  the  univariate  algorithm,  UNIGCD, 


relying  on  the  theoretical  similarities  of  the  multivariate 


and  the  univariate  cases. 


•ithm  III  - 3.1:  (Univariate  GCD  - UNIGCD) 


Input:  Univariate  polynomials  F’ (x)  and  G’  (x)  in 


ZIx]  where  deg(F’)  is  assumed  to  be  > deg(G’).  (Otherwise 


simply  switch  them  and  later  also  switch  the  order  of  the 


output  cofactors.) 


Output:  D’ (x)  «gcd(F’,  G’)  in  ZIx],  F’/D’ , andG’/D’. 


(Al)  Set  f*  < — cont(F’),  F < — pp(F’ ),  g’  < — cont(G’), 


G < — pp(G’),  and  d’  < — i gcd ( f * , g’)  uhere  the  " i ’ 


stands  for  integer  operations. 


(A2)  Set  fl  < — I c (F) , gl  < — lc(G),  and  dl  - i gcd ( f 1 , gl). 


Set  d <—  deg(F)  and  d <—  deg(G). 
F G 


Set  p to  a neu  prime  such  that  p does  not  divide  fl  gl. 


Se'  F < — F (mod  p),  G < — G (mod  p), 
P P 


D <—  mgcd(F  , G ) uhere  the  "m"  stands  for  modular 
P P P 


operations,  and  d < — deg(0  ) 

P 


(A3)  If  d ■ 0 then  D ■ 1 and  return  d’,  (f’/d*)  F,  and 

P 
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(g’/d’)  G.  Otherwise,  if  (A4)  has  been  passed  through 
already,  then  skip  to  (A5). 

(A4 ) Choose  a new  prime  p’  such  that  p*  does  not  divide 

fl  gl  and  compute  D ■ mgcd(F  , G ). 

P’  P’  p’ 

If  deg(D  ) < d,  then  set  F <—  F , G <--  G , 

P’  p p’  p p’ 

D <--  0 . p <--  p’,  d <—  deg(D  ),  and  repeat  (A4). 

P P'  p’ 

If  deg(0  ) =*  d,  then  go  to  (A3). 

P' 

If  deg(D  ) > d,  then  repeat  (A4). 

P' 

(A5)  If  d » d , then  test  uhether  G divides  F,  if  so  return 
G 

d’  G.  (f'/d’)  (F/G),  and  g’/d’. 

If  test-division  fails  then  go  to  (A6). 

If  d » d , then  continue  to  (A7). 

G 

(AG)  Choose  neu  primes  p"  until  a new  D » mgcd(F  , G ) 

p p p 

is  found  whose  degree  is  less  than  d.  Then  set 

F < — F , G < — G , D < — D , p < — p", 
p p p P P P 

d <—  deg(D  ),  and  go  back  to  (A3). 

P" 

(A7)  If  mgcd(D  , G /D  ) > 1,  then  set  U < — G,  D <--  D , 
p p o 0 p 


H <--  G /D  , and  go  to  (A8).  If  mgcd(D  , F /D  ) - 1, 


27 

then  set  U * F,  0 ■ 0 , H •F/O,  and  go  to  (A8) . 

0 p 0 p p 

Otherwise,  go  to  the  special  case  algorithm  (SI). 

(A8)  Apply  the  Univariate  EZ  Algorithm  (II-5.3)  to  U(x) 
in  Z(x],  and  0 (x),  H (x)  in  Z (xl . 

0 0 p 

If  the  outputs  are  0 and  H such  that  U - 0 H over  Z 

then  continue  to  (A3 ) . Otherwise,  the  outputs  are 

0 , H , and  q such  that  U ;0  H (mod  q ), 
k k k k k k 

then  p must  be  unlucky  so  that  a new  prime  p"  must  be 

found,  go  to  (A6). 

(A9)  If  U ■ G,  test  whether  0 divides  F, 

if  so  return  d’  0,  (f'/d*)  (F/0) , and  (g’/d’)  H. 

If  U = F,  test  uhether  0 divides  G, 

if  so  return  d’  0,  (f’/d’)  H,  and  (g'/d’)  (G/D). 

Otherwise  p must  be  unlucky,  so  go  to  (AG). 

(51)  Set  G"  <~  UN  I GCD  (G , dG/dx)  and  L <—  G/G* 

(via  Steps  (A2)  - (A3) ) . 

(52)  Set  L <—  UNI  GCD  (L,  F)  and  F"  <—  F/L 

(via  Steps  (A2)  - (A9) ) . Set  0"  <--  L and  0 < — L. 

(53)  Set  0 <--  mgcdd  , F",  G")  where  p is  the  valid, 

P p p p 

lucky  prime  already  used  in  UNIGCO  Steps  (A2)  - (A3) 
of  (S2) . 

(54)  If  deg(0  ) =»  0,  then  0 - 1,  so  return  d’  0", 

P P 
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(f’/d’l  (F/D"),  and  (g’/d’)  (G/D"). 

(S5)  I f L /D  *1,  then  skip  (SG)  and  go  to  (S7) . 

P P 

(SG)  Apply  Univariate  EZ  Algorithm  to  L in  Ztx]  and  D - D , 

0 P 

H - L /D  in  Z [x]  to1 get  D and  H in  Zlx]  such  that 

0 p p p 

L = D H over  Z.  Set  L < — D. 

(S7)  Set  D"  <~  0"  D,  F"  <~  F"/D  , G"  <-  G"/D  , 

P P P P P P 

and  go  to  (S3) . 

Non,  ue  make  some  brief  remarks  and  explanations  on  the 

individual  steps  of  the  algorithm  especially  those  relevant 

to  univariate  case,  leaving  detailed  discussions  of  points 

common  to  both  cases  to  the  multivariate  algorithm  later. 

Remark  (A4):  This  is  the  step  uhere  ue  attempt  to 

es tab  I i sh  Condi t i on  1II-B,  deg(D  ) - deg(D).  Since 

0 

D =>  gcd(F,  G)  is  the  unknoun  polynomial  to  be  determined, 
ue  have  no  uay  of  verifying  that  d - deg(D).  In  fact,  by 
Lemma  1 1 1-2.1,  d couid  be  > deg(D).  Houever,  the  probability 
of  the  chosen  prime  p being  unlucky  is  bounded  by  1/p  (see 
Theorem  3 of  1BR071 ] ) . If  ue  use  fairly  large  primes  and 
attempt  as  in  (A4)  to  repeat  d uith  another  prime  p then  we 
can  greatly  reduce  tue  probability  for  p to  be  unlucky  or 
greatly  inhance  the  probability  of  having  Condition  1 ! I -B  be 
true.  Thus,  in  Step  ( A4)  ue  try  uith  another  prime  p’.  If 


•JT-rT®  W'SSST ",™?,v’^i’!«'  *7 
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p was  unlucky  so  that  d > deg(O),  the  probability  of  p’ 

being  unlucky  an^  resultino  in  a D with  degree  equal  to  d 

P’ 

i9  at  least  bounded  by  1/ (p  p’).  That  means  that,  this  way, 
it  is  very  unlikely  for  us  to  use  a unlucky  prime  p for  our 
Hensel  construction  later. 

Remark  (A7):  Here,  we  determine  which,  If  one,  of  the 

given  polynomials  satisfy  Condition  III -A.  Since  it  is  a 
good  strategy  to  apply  Hense1  constructions  on  smaller 
polynomials,  we  test  the  condition  on  G first,  relying  on 
the  heuristic  basis  that  polynomials  of  smaller  degrees 


contain  fewer  terms.  If  Condition  II I -A  fails  on  both 
polynomials,  then  we  cannot  directly  apply  the  Hensel 
construction  to  this  problem.  However,  our  special  case 
algorithm,  which  applies  the  Hensel  construction  also,  will 
always  apply  for  any  tuo  polynomials  as  ue  Mill  soon  see. 

Note  that  it  is  possible  for  Condition  III -A  to  fail  even 
though  either  gcd(0,  F/0)  • 1 or  gcd(0,  G/D)  * 1,  if  the 
chosen  prime  is  unlucky.  Because  of  the  fact  that  ue  have 
decreased  the  probability  for  p to  be  unlucky  and  that  the 
special  case  algorithm  uork9  in  general,  ue,  therefore,  have 
a complete  GCD  algorithm  uhich  uses  the  Hensel  construction 
for  al I cases. 

Remark  (A8):  By  Theorem  1 1-5.3  and  uhen  q i9  suffi- 

k 

I 
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ciently  large  to  bound  all  possible  integral  coefficients, 

the  Univariate  EZ  Algorithm  returning  only  0 and  H such 

k k 

that  U =■  D H (mod  q ) could  only  mean  the  lack  of 
k k k 

any  divisor  of  U over  the  integers  corresponding  to  this 

0 (x)  or  having  the  same  degree  as  0 (x).  Thus  Lemma  1 1 1-2.  J. 
0 0 

would  imply  that  deg(D  ) > c!?g!D)  or  p is  unlucky. 

0 

Remark  (SI):  The  basic  underlying  principle  of  the 

special  case  method  is  that  if  one  of  the  two  original 
polynomials  is  square- free,  then  it  is  always  possible  to 
find  a prime  such  that  Condition  II1-A  holds.  Viewing  this 
another  way,  it  is  always  possible,  when  given  two 
polynomials,  to  compute  the  largest  square-free  part  of 
their  GCO  first  and  then  the  largest  square-free  part  of  the 
GCO  of  the  remaining  cofactors  successively.  Thus  we  will 
always  be  able  to  compute  essentially  the  square-free 
decomposition  of  the  GCO  of  any  two  polynomials,  hence  the 
GCO  itself. 

The  special  case  algorithm  first  computes  the  largest 
square-free  part  of  G,  then  uses  that  with  F to  get  the 
largest  square-free  part  of  the  GCO  of  F and  G.  Ue  leave 
many  detailed  discussions  on  the  individual  steps  of  this 


I 


special  case  algorithm  until  later  when  we  present  the 
multivariate  rase  and  the  Hensel-type  square-free 
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decomposition  algorithm  (Chapter  VI). 


Remark  (S2):  The  reason  for  only  needing  S ep9  (A2)  - 


(A9)  for  UNI GCD (G.  dG/dx)  and  for  UNIGCD(L,  F)  should  be 


nuite  clear.  However,  proofs  for  them  wi 1 1 be  presented  in 


Chapter  VII. 


As  compared  to  the  modular  GCO  algorithm  for  univariate 


polynomials,  the  essential  difference  for  the  UN1GCD  algo- 


rithm is  the  9tep  of  inverting  the  modular  homomorphism. 


Except  for  the  higher  overhead  of  the  Zassenhaus’  Quadratic 


Extension  Algorithm,  i.e.  the  cost  of  applying  the  Extended 


Euclidean  Algorithm  (1 1-2.1)  on  the  two  modular  codivisors. 


the  remaining  operations  are  mainly  integer  arithmetic  only. 


The  Chinese  Remainder  Algorithm  also  only  involves 


arithmetic  operations  on  each  pair  of  corresponding  coef- 


ficients, except  each  of  these  require  one  application 


of  the  integral  extended  Euclidean  algor i thm.  In  general, 


if  p is  an  average-sized  prime  that  we  U9e  in  either 


algorithm  (we  assume  that  the  primes  used  all  have  appro- 


ximately the  same  integer  length),  and  B is  the  bound 


jn  all  integral  coefficients  for  a given  problem,  then  the 


major  computational  gain  of  using  the  Zassenhaus’  algorithm 


is  due  to  the  fact  that  it  requires  only  k 9tep9  where 


k - log  (log  (B) ) , uhereas  the  modular  algorithm  will  need 
2 p 


to  apply  the  Chinese  Remainder  Algorithm  on  corresponding 


I 


I 


coefficients  for  k'  step9  where  k*  - log  (B).  Thus  the 

P 


method  using  the  Zassenhaus'  algorithm  can  shou  some  compu- 
tational efficiencies  when  B is  quite  iarge  or  p is  small  so 


that  k’  * log  (B),  which  is  the  number  of  steps  the  nodular 
P 


GCD  Algorithm  has  to  go  through,  is  large  compared  to  k 


( = log  Ik’)).  However,  it  is  very  costly  for  the 

2 


Zassenhaus’  Quadratic  Extension  Algorithm  to  have  to  main- 
tain relative  primeness  of  the  two  codivisor3  being  con- 


structed (i.e.  getting  A and  B such  that 

j j 


A G + B H * 1 (mod  q )s  see  Lemma  1 1-3.2).  In  this 
j j j j j 


respect,  it  seems  that  the  ordinary  Hense I method  (Algo- 
rithm 1 1 -3. 1)  will  prove  to  be  more  efficient  when  the 
number  of  steps,  k’ , for  this  algorithm  is  relatively  small 
(e.g.  k’  < 16  or  k < 4) , Also  the  computations  of  succes- 


sive residual  terms  in  both  algorithms  (the  C ’ s) , require 

j 


cross  multiplications  of  the  current  factors  which  can  be 
very  costly,  especially  when  the  coefficients  are  large. 
These  complex  trade-offs  in  computational  efficiencies  of 
the  three  modular  methods  for  computing  univariate  GCD's  are 
currently  under  active  study  and  further  results  will  be 
reported  in  a future  paper  by  Yun  and  fliola. 


Ill  - 4 The  flu  1 1 i variate  GCD  Algorithm  - EZGCD 

With  Section  1 1 1 -2  as  theoretical  basis  and  the  last 
section  as  brief  introduction.  Me  now  present  the  detailed 
EZGCD  Algorithm  with  remarks  on  some  of  'the  steps  to  follow 
afterwards. 

Algorithm  III-4.1:  (Multivariate  GCC  - EZGCD) 

Input:  Multivariate  polynomials  F'(x,y  ) and 

1 v 

G'  <x,y  , . . . ,y  ) in  Z(x,y  , . . . ,y  ] where  deg  IF’ ) in  x is 
1 v 1 v 

assumed  to  be  > deg(G’)  (Otherwise,  simply  switch  the  input 
polynomials  and  later  also  switch  the  order  of  the  output 
cofactors. ) 

Output:  D'(x,y  ) « gcd(F',  G’ ) in 

1 v 

Ztx.y y]  F’/D\  and  G’/0\ 

1 v 

(Al)  Set  f’  <—  cont(F’),  F <--  pp(F’),  g’  <—  cont(G’), 

G <~  pp(G'),  and  d’  <—  gcd(f’,  g’). 

(A2)  Set  d < — deg (F (x> ) and  d < — deg (G (x) ) . 

F G 

If  b = lb  *0,  b -0,  ....  b -01  is  a valid  evaluation 
1 2 v 

(i  e.  ( I c (F) ) and  ( I c (G) ) 4 0),  then  compute  F (x), 
b b b 

G ( x V , and  D (x)  » gcd(F  , G ).  Otherwise,  find  a new 
b b b b 

valid  evaluation  b which  contains  as  many  zeros  as 


AM*,  li 


possible  and  compute  D (x)  ■ gcd(F  (x),  G (x)).  Set 

b b b 

d < — deg(Q  (x) ) . 
b 

(A3)  If  d =»  0,  then  return  d',  (f’/d’)  F,  and  (g’/d’)  G. 
Otherwise,  if  (A4)  has  been  passed  through  already, 
then  skip  to  (A5). 

(A4)  Choose  a new  valid  evaluation  b’  and  compute 

D (x)  = gcd (F  (x),  G (x)). 
b’  b*  b’ 

If  deg(D  ) < d,  then  set  F < — F , G < — G , 
b’  b b*  b b’ 

D <—  D , P <—  P’,  d <—  deg(D  ),  and  repeat  (A4). 
b b’  b’ 

If  deg(D  ) » d,  then  go  to  (A3), 

P* 

If  deg(D  ) > d,  then  repeat  (A4), 

P* 

(A5)  If  d - d , then  test  uhether  G divides  F,  if  so  return 
G 

d*  G.  (f’/d* ) (F/G),  and  y7d\ 

If  test-division  fails,  then  go  to  (AG). 

If  d / d , then  continue  to  (A7), 

G 

(AG)  Choose  a new  valid  evaluation  b"  until  the  correspond- 
ing 0 (x)  - gcd(F  (x),  G (x))  has  degree  'ess  than 

b"  b"  b" 

d.  Then  set  F <—  F , G <--  G , D < — D , 
b b"  b b"  b b" 


b < — b",  d < — deg(D  ),  and  go  to  (A3). 
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(A7)  If  gcd(D  , G /D  ) - 1,  then  set  U < — G,  D < — D , 
b b b lb 

H <—  G /D  , and  go  to  (A8). 

1 b b 

If  gcd(D  , F /D  ) • 1,  then  set  U < — F,  D < — D , 
b b b lb 

H < — F /D  , and  go  to  (A8). 

1 b b 

Otherwise,  go  to  the  special  case  algorithm  (SI). 

IA8)  Apply  the  Multivariate  EZ  Algorithm  1 1-5.4  to  U in 

(Z  [y  y 3 ) lx)  and  D (x),  H lx)  in  (Z/q  ) [x]  where 

1 v 11  k 

q = p**(2**k)  for  some  lucky  prime  p for  U . 
k b 

If  the  outputs  are  D and  H such  that  U - D H over  Z, 

then  continue  to  (A9) . Otherwise,  the  outputs  are  D , 

n 

n 

H . q , and  n such  that  U jD  H (mod  q , S ),  then 
n k n n k 

b must  be  an  unlucky  evaluation  so  that  a new  evalua- 
tion must  be  found,  go  to  (AG). 

(A9)  If  U - G,  test  whether  D divides  F,  if  so  return  d’  D, 

( f * /d* ) (F/D),  and  (g'/d')  H. 

If  U = F,  test  whether  D divides  G,  i f so  return  d’  D, 
(f’/d’)  H,  and  (g’/d’)  (G/D). 

Otherwise,  b must  be  an  unlucky  evaluation,  so  go  to  (AG). 
(SI)  Set  G"  < — EZGCDfG,  dG/dx)  and  L < — G/G"  (via  Steps 


Eib 
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.7 


(A2)  - (A9)  above). 

(52)  Set  L <—  EZGCD(L,  F)  and  F”  <—  F/L  (via  Steps  (A2)  - 
(A9)  above).  Set  0"  < — L and  D < — L. 

(53)  Set  D (x)  <—  gcd(L  (x),  F"(x),  G"(x))  where  b is  the 

b b b b 

valid,  lucky  evaluation  already  used  in  EZGCD  Steps 
(A2)  - (A9)  of  (S2) . 

(54)  If  deg (D  (x) ) - 0,  then  D (x)  - 1,  so  return  d’  D", 

b b 

(f’/d)  (F/D " ) , and  (g’/d’)  (G/D"). 

(SB)  I f L /D  ■ 1,  then  skip  (S6)  and  go  to  (S7). 
b b 

(56)  Apply  Multivariate  EZ  Algorithm  to  L in  Ztx.y  ], 

1 v 

and  D * 0 , H ■ L /0  in  (Z/q  ) tx),  where 
1 b 1 b b k 

q - p**(2**k)  for  some  lucky  prime  p for  L , to  get  D 
k b 

and  H in  Z(x,y  ,...,y  ] such  that  L - D H 'over  Z.  Set 
1 v 

L <~  D. 

(57)  Set  D"  <—  D"  0.  F"  <—  F"/D  , G"  <—  GVD  , and  go 

P P P p p p 

to  (S3). 

Remark  (Al):  Contents  and  primitive  parts  of  any  mul- 

tivariate polynomials  are  computed  recursively  on  the  varia- 
bles. Here,  the  contents  are  taken  w.r.t.  the  main  variable 

x and  are  polynomials  in  Z ty  , ...,y  ],  That  is  we  consider 

1 v 


I 

I 

l ! 
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the  given  polynomial  to  be  in  (Z(y y 1 ) [x] . The  compu- 

1 v 

tation  involves  GCD’s  of  all  coefficients  with  respect  to  a 
particular  main  variable  (here  x).  This  ie  accomplished  by 
using  a variant  of  this  algorithm  which  computes  the  GCD  of 
all  coefficients  in  a semiparallel  fashion  uhich  will  be 
discussed  in  more  detail  in  Chapter  Vi. 

Remark  (A2):  In  Section  1 1 —4  and  II-6  we  have  briefly 

mentioned  the  potential  seriousness  of  the  "non-zero  sub- 
stitution" problem.  Section  II-6  points  out  that  the  tech- 
nique we  use  for  putting  polynomials  into  the  generalized 
Taylor  series  forms  may  cause  blowups  in  the  number  of  terms 
resulting  in  computational  inefficiencies  for  certain  bad 
cases.  Ue  see  from  the  uay  this  transformation  is  carried 
out  (Sec.  1 1 -4),  it  is  desirable  to  have  as  many  zero9  in 
the  set,  b.  of  the  integral  values  for  evaluation  a9  possi- 
ble so  that  the  number  of  terms  in  the  expanded  form  of  the 
series  is  not  increased  unnecessarily.  Due  to  the  require- 
ment for  valid  and  lucky  evaluations,  it  is  not  always 
possible  to  choose  zeros  for  evaluation.  But  more  zeros  in 
the  set  of  values  used  for  the  Generalized  Hensel  construc- 
tion certainly  improves  its  efficiency  because  of  the 
dependency  of  the  computing  cost  for  the  GHA  on  the  number 
of  terms  in  the  Taylor  series  form  of  the  given  polynomial 
as  we  shall  see  in  the  next  chapter.  Since  it  is  not  po9si- 


\ 


-.3 


ble  to  check  whether  an  evaluation  is  lucky  or  not  at  this 

point,  we  only  make  sure  the  set  of  values  for  evaluation 

contain  as  many  zeros  as  possible  and  still  remain  valid. 

Remark  (A3) : Since  deg (gcd (F , G) ) < deg(D  (k) ) , 

b 

d =»  0 implies  gcd(F,  G)  is  independent  of  x.  But  F and  G 

are  primitive  so  that  gcd(F,  G)  must  simply  be  1.  Thus, 

gcd  IF',  G’)  is  just  d’  and  the  cofaotors  are 

F’/d’  ■ (f'/d’)  F and  G’/d'  - (g’/d')  G respectively. 

Remark  (A4):  In  this  step,  we  attempt  to  establish 

Condition  III-B,  deg(0  ) - deg(O).  Since  D - gcdlF,  G)  is 

0 

the  unknown  polynomial  to  be  determined,  ue  have  no  uay  of 
verifying  that  d ■ deg(D)  at  this  point.  In  fact.  Lemma 
1 1 1-2. 1 indicates  that  d could  be  > deg(D).  However,  the 
total  number  of  possible  unlucky  integral  values  for  evalua 
tion  is  bounded  by  (v  m deg (F) ) where 
v 

m *=  max  (deg  (F)  + deg  (G) ) (see  Theorem  5 of  [BR071D.  S 

i “1  y y 

i i 

even  if  ue  limit  our  possible  choice  of  integral  values  to 

only  those  in  Z/p  for  some  prime  p,  the  probability  for  any 

evaluation  b = lb  ,...,b  ! to  be  unlucky  is  bounded  by 
1 v 

2 

v m deg(F)/p.  If  the  given  GCO  problem  is  of  reasonable 

2 

size  so  that  v m deg(F)  is  not  too  large  and  the  prime  is 


fairly  large,  then  the  task  of  (A4) , which  tries  to  repeat 


the  same  degree  d with  another  evaluation  b’,  will  greatly 


reduce  the  probability  for  b to  be  unlucky  or  inhence  the 


probability  of  having  Condition  II-B  be  true.  Because  of 


the  fact  that  deg(D)  < deg(D  ■ gcd(F  , G ))  for  any  given 

b b b 


valid  evaluation  b,  note  that  if  we  make  several  evaluations 


of  F and  G at  different  sets  of  values,  then  the  one  result- 


ing in  the  minimal  degree  for  D should  be  used.  In  fact, 

b 


if  the  number  of  trial  evaluations  are  larger  than 


v m deg (F)  then  we  are  sure  to  find  one  evaluation  among 


them  which  has  degree  equal  to  deg(D),  In  Step  (A4)  we  try 


to  find  a valid  evaluation  b which  yields  a D (x)  having  the 

b 


degree  that  is  repeated  by  some  other  valid  evaluation. 


Thus  if  the  first  evaluation  is  unlucky  and  results  in  a 


high  degree,  the  chances  of  repeating  this  urong  degree  are 


much  smaller.  This  step  virtually  insures  the  success  of 


getting  a lucky  evaluation.  Together  with  Step  (A2) , we  can 


be  quite  certain  that  the  evaluation  b we  use  in  this 


algorithm,  hence  for  the  Hensel  construction,  i9  valid, 


lucky,  and  contains  many  zeros. 


Remark  (A5) : Since  deg(G(x))  < deg(Flx))  and  F,  G are 


primitive,  when  d - d after  Step  <A4)  it  is  very  likely 

G 


4 - 
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that  G divides  F.  If  this  test-division  fails  ue  still  get 
the  valuable  information  that  d should  be  smaller  than  it  is 
now.  Therefore,  this  is  a strategic,  time  saving  step  in 
any  case. 

Remark  (AG):  This  step  is  used  at  several  other  places 

of  the  algorithm,  whenever  we  can  get  the  information  about 
d being  too  large  or  the  corresponding  evaluation  b is 
unlucky.  In  these  cases  we  know  there  is  an  evaluation,  say 
b",  which  can  result  in  an  univariate  GCD  of  louei  degree 
than  d.  So  we  keep  on  choosing  new  valid  evaluations  until 
such  a b"  is  fourd. 

Remark  (A7) : Here  we  determine  which,  if  one  of  the 

given  polynomials  satisfy  Condition  III-A  so  that  the  Hensel 
construction  can  be  applied.  Since  it  is  a good  strategy  to 
apply  Hensel  constructions  on  polynomials  with  fewer  terms, 
we  test  the  condition  on  G first,  relying  on  the  heuristic 
basis  that  polynomials  of  smaller  degrees  contain  fewer 
terms.  If  Condition  III-A  fails  on  both  polynomials,  then 
we  cannot  directly  apply  the  Hensel  construction  to  this 
problem.  However,  our  special  case  algorithm,  which  also 
applies  the  Hensel  construction  will  always  apply  for  any 
two  polynomials.  Note  that  if  the  chosen  evaluation  is 
unlucky  it  is  possible  for  Condition  III-A  to  fail  even 
though  either  gcd(D,  F/0)  - 1 or  gcd(D,  G/D)  - 1.  But  Step 
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(A4)  has  reduced  the  probability  for  using  an  unlucky 
evaluation  and  the  special  case  algorithm  can  handle  any 
polynomial  inputs,  so  we  have  a complete  GCD  algorithm  which 
applies  the  Hensel  construction  for  all  cases. 

Remark  (A8) : Note  that  the  application  of  the  Multi- 

variate EZ  Algorithm  (I  I -5.4)  requires  a lucky  prime  p for 

U - because  it  is  necessary  to  compute  A(x)  and  B(x)  such 
b 

that  AD  + BH  - 1 in  (Z/q  ) lx]  by  using  the  Extended 
11  k 

Euclidean  Algorithm  (1 1-2.1).  This  lucky  prime  can  be  found 
in  several  ways.  One  such  method  is  simply  using  the  lucky 
prime  from  the  GCD  computation  in  Step  (A7),  assuming  of 
course  the  univariate  GCD  computations  are  performed  by 
cither  the  Modular  GCD  Algorithm  or  the  UNI GCD  Algorithm 
which  requires  a lucky  prime  for  its  computations.  In  fact, 
such  a lucky  prime  can  also  be  provided  by  any  other  step  of 
this  algorithm  (e.g.  (A2) ) where  univariate  GCD’s  are  com- 
puted, so  long  as  the  univariate  GCD  algorithm  used  is  the 
modular  kind.  By  Theorem  1 1-5.4  and  since  p is  lucky,  q 


and  n are  sufficiently  large,  the  case  where  the  Multiva 

riate  EZ  Algorithm  returns  only  D anc  H such  that 

n n 


U ■ 0 H (mod  q , S ) can  only  mean  the  non-existence  of 
n n k 


■■■•.'•-A. 
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any  divisor  of  U corresponding  to  this  D (x).  Lemma  111-2.1 

1 

would  then  imply  that  deg(0  (x))  > deg(D)  or  that  b is  an 

1 

unlucky  evaluation. 

Remark  (A9):  If  U - G,  then  G - D H over  Z.  Since 

deg(gcd(F,  G) ) < deg(gcd(F  , G ))  and  primitive  polynomials 

b b 

over  Z are  uniaue  up  to  signs,  D must  be  gcdlF,  G)  if  D also 
divides  F.  If  D does  not  divide  F,  then  D must  contain  some 
factor  of  G which  is  not  a factor  of  F.  That  is  the  degree 
of  D must  be  too  big  or  this  evaluation  b is  unlucky,  so  ue 
must  re-evaluate  by  going  back  to  Step  (AS).  This  is  a 
"safety-valve"  step  which  catches  all  hitherto  undetected 
unlucky  evaluations  even  though  the  probability  of  such  has 
been  greatly  reducd  by  Step  (A4). 

Remark  (SI):  Similar  to  Remark  (SI)  of  the  UNIGCD 

Algorithm,  we  point  out  again  that  the  basic  underlying 
principle  of  the  special  case  method  (SI)  - (S7)  is  that  if 
one  of  the  two  original  polynomials  is  9quare-free,  then  it 
is  always  possible  to  find  an  evaluation  such  that  Condition 
! 1 1 -A  holds.  That  is,  for  two  arbitrary  multivariate 
polynomials  it  is  always  possible  to  compute  (using  the 
Hensel  construction)  the  largest  square-free  part  of  their 
GCO  first,  and  then  the  largest  square-free  part  of  the  GCD 
of  the  remaining  cofactors,  successively.  Thus,  we  will 
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always  be  able  to  compute  essentially  the  square-free 

decomposition  of  the  GCO  of  any  two  polynomials  using  Hensel 

construction,  hence  the  GCO  itself. 

The  special  case  algorithm  first  computes  the  largest 

square-free  part  of  G,  then  uses  that  with  F to  get  the 

largest  square-free  part  of  the  GCD  of  F and  G.  This  step 

computes  the  largest  square-free  part  of  G,  by  computing  the 

cofactor  of  G with  respect  to  gcd(G,  dG/dx).  This  GCD  can 

always  be  computed  by  going  through  Steps  (A2)  - (AS)  only. 

The  reason  for  this  is  made  clear  by  the  following  lemma: 

Lemma  (SI):  Let  P be  primitive  in  (Zty  DM 

1 v 

and  R - gcd(P,  dP/dx) , then  gcd(R,  (dP/dx)/R)  - 1. 

Note  that  this  virtually  insures  that  of  Condition  III  -A 
holds,  so  that  there  will  be  no  need  for  special  case 

consideration  for  gcd(G,  dG/dx). 

Here  ue  point  out  that  this  special  case  algorithm  (SI) 

- (S7)  with  the  above  lemma  and  some  other  conditions 
constitutes  the  essential  ideas  for  an  algorithm  of  finding 
the  square-free  decomposition  of  a multivariate  polynomial 
using  the  Hensel  construction.  The  clarification  of  this 
statement  together  with  the  proof  of  the  above  lemma  will  be 
deferred  to  Chapter  VI  on  square-free  decompositions  of 


polynomials. 

Remark  (S2):  Since  L is  square-free,  GCD  of  L and  any 


other  polynomial  can  be  computed  by  Step  (A2)  - (A9)  of  th i s 
EZGCD  Algorithm.  That  is,  when  L is  square-free,  it  is 
always  possible  to  find  a valid  lucky  evaluation  for  L such 
that  L evaluated  is  still  square-free  (see  Chapter  VI  or 
[USR731  for  a proof).  0"  uill  be  used  to  denote  the 
accumulative  GCO  up  to  this  point,  D the  current  largest 
square-free  part  GCO,  and  F",  G"  the  cofactors  of  D"  in  F,  G 
respectively. 

Remark  (S3):  If  b is  a valid,  lucky  evaluation  for  L 
and  F in  (S2)  then  it  must  be  lucky  for  both  F and  G or  for 
any  square-free  part  of  their  GCO.  Therefore,  from  this 
point  on,  ue  uill  aluays  use  this  valid,  lucky  evaluation 
for  our  succeeding  computations.  Uhat  ue  intend  to  compute 
nou  is  gcd(L,  F",  G").  But  since  b is  a lucky  and  valid 
evaluation  for  this  problem,  all  that  needs  to  be  computed 
dt  this  point  is  the  univariate  GCD  of  the  three  polynomials 
evaluated.  Ue  can  then  use  this  univariate  GCO  for  the 
Hensel  construction  uith  L to  get  the  actual  mutivariate 
gcd(L,  F",  G").  Here  the  GCD  is  a univariate  GCD  taken  over 
the  integers.  GCO  of  several  polynomials  can  be  taken 
recursively  by  successive  pairs.  Houever,  as  ue  shall  see 
in  the  next  chapter,  it  is  also  possible  to  semi -para  1 1 e I I y 
compute  the  GCO  of  several  polynomials  and  their 
corresponding  cofactors  together  for  either  the  multivariate 
or  the  univariate  cases  using  Hensel  constructions. 
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Remark  (55):  I f L /D  - 1,  then  we  know  that  the 

b b 

largest  9quare-free  part  of  the  remaining  cofactors  of  D"  in 

F and  G (i.e.  F"  and  G")  is  the  same  as  the  current  largest 

square-free  part,  L,  or  0 ■ L.  It  is  then  not  necessary  to 

apply  the  Multivariate  EZ  Algorithm  and  simply  use  the 

current  largest  square-free  part  of  the  GCD,  D,  to  update 

the  accumulative  GCD,  D",  in  (57).  Thus  whenever  L /D  ■ 1 

b t. 

we  can  save  one  application  of  the  EZ  Algorithm.  For 
3 2 2 3 

example,  if  F - U V and  G - U V , then  L » D ■ U V in 

2 2 

(S2),  F"  - U V and  G"  - U V , so  that  gcd(l,  F",  G")  - U V 

again  and  in  (S3)  D » U V - L resulting  in  L /D  - 1. 

b b h b b b 

Remark  (SG):  Here  we  apply  the  Multivariate  EZ  Algori- 

thm on  L,  D , and  H to  get  D • gcd(L,  F",  G"),  as  we  set 

1 1 

out  to  do  in  Step  (S3).  The  lucky  prime  p for  L can  be  the 

b 

one  used  in  (S3)  for  the  computation  of  univariate  GCD,  D . 

b 

Uith  these  guaranteed  lucky  prime  p,  a va' ■ lucky  evalua- 
tion b,  and  a square-free  L,  all  conditions  of  Theorem 
1 1 -5.4  (Multivariate  EZ)  are  satisfied  so  that  the  EZ  Algo- 
rithm produces  the  correct  multivariate  polynomials  D and  H 
such  that  L • D H over  Z.  This  0 must  be  the  GCD  of  L,  F", 
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and  G",  or  the  largest  square-free  GCD  of  F"  and  G".  This 

largest  square-free  GCO  of  F"  and  G",  D,  must  be  smaller 

than  the  previous  one,  L,  and  ue  can  take  advantage  of  it  by 

using  this  0 (setting  L <—  D)  for  computation  of  later 

largest  square-free  GCO's  for  the  neu  cofactors  F"  and  G" 

corresponding  to  F"  and  G"  of  the  next  step, 
b b 

Remark  (S7):  Here  ue  update  the  accumulative  GCD  to 

include  the  D just  found.  Ue  then  also  update  the  cofactors 
to  correspond  to  this  0",  in  order  to  begin  the  computation 
of  the  ne^t  largest  square-free  GCD.  But  ue  nave  already 
obse'  ved  that  in  (S3)  ue  only  need  the  univariate  (or  eva- 
luated) cofactors,  F"  and  G",  so  we  only  need  to  update 
b b 

these  univariate  nolynomials  instead  of  the  corresponding 
multivariate  polynomials.  Therefore,  we  see  that  after 
Steps  (SI)  and  (S2),  the  only  multivariate  polynomials  used 
in  the  remaining  computations  are  the  largest  square-free 
GCO’s.  L,  in  the  application  of  the  EZ  algorithm.  The  final 
answer  gcd(F,  G)  i9  simply  0"  uhich  is  the  product  of  the 
results  of  the  EZ  Algorithm  in  (SB).  This  fact  will  account 
for  the  comparable  efficiency  of  this  special  case  algorithm 
with  the  regular  algorithm  ( ( A1 ) - (A3) ) and  the  Modular  GCD 
A I gor i thm. 


..  ..  
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III  - 4(a)  Solution  to  the  Leading  Coefficient  Problem 

Because  of  the  non-uniqueness  of  polynomials  in  image 
domains,  all  algorithms  based  on  modular  techniques  suffer 
by  having  to  perform  some  extra  process  in  order  to 
determine  a unique  answer  from  its  class  of  associates. 

This  is  often  accomplished  by  recursively  requiring  a unique 
leading  coefficient.  Since  the  GCO  algorithm  uas  one  of  the 
earliest  considered  i’rom  this  point  of  vieu,  the  leading 
coefficient  problem  naturally  came  up.  In  his  nou  classic 
paper  on  the  Modular  GCO  Algorithm  CBR071 ] , Broun  clearly 
recognized  this  problem.  His  impl ici tly  proposed  solution 
(only  given  in  the  algorithmic  specifications  for  Algorithm 
M and  P)  uses  the  GCO  of  the  leading  coefficients  of  the  two 
given  polynomials.  This  polynomial  '.s  clearly  divisible  by 
the  leading  coefficient  of  the  true  GCO  of  the  tuo  given 
polynomials.  The  Modular  GCO  Algorithm  then  artificially 
imposes  this  onto  a monicized  GCO  so  as  to  have  a unique 
polynomial  uhich  is  a simple  multiple  of  the  actual  GCO.  In 
fact,  the  GCO  is  the  primitive  part  of  this  polynomial. 

This  approach  still  results  in  the  blouup  problem, 
uhich  ue  discussed  in  Section  II -5 (a).  Although  our 
solution  to  this  leading  coefficient  blcuup  (LCB)  problem  is 
in  essence  similar  to  Broun’s  approach,  some  important 
differences  as  uell  as  the  significance  of  this  blouup 
problems  alone  make  this  separate  treatment  uorthuhi le.  Our 
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approach  achieves  the  same  goal  in  the  end  and  also  uses  the 
GCO  of  the  leading  coefficients  as  does  the  nodular 
Algorithm,  but  the  means  for  avoidim  the  blouup  are 
different  in  many  ways.  Most  improtantly,  our  method 
preserved  the  sparseness  of  the  problem  during  computation 
at  a small  cost  (of  computing  the  GCD  of  the  leading 
coefficients  and  multiplying  it  to  the  polynomials). 

In  Section  II -5 (a)  ue  discussed  the  causes  and  the 
potential  seriousness  of  the  LCB  problem.  Ue  also  presented 
a method  (Theorem  and  Algorithm  11-5.5)  of  avoiding  this 
problem  and  an  example  demonstrating  its  feasibility.  But 
it  is  not  always  possible  to  know  in  advance  the  leading 
coefficients  of  the  co-divisors  that  the  Hensel  construction 
is  attempting  to  find.  So  the  method  proposed  by  Theorem 
1 1 -5.5  must  have  appeared  extremely  artif  icial  and 
impractical.  In  the  case  of  the  EZGCD  Algorithm,  however, 
there  is  actually  much  to  be  gained. 

Let  F and  G be  the  primitive  parts  of  the  input  poly- 
nomials for  the  EZGCD  Algorithm  and  D be  gcd(F,  G)  to  be 

computed.  Consider  F,  G,  D to  be  in  (Z(y  ,...,y  ) ) (x)  and 

1 v 

let  fl  = Ic (F) , gl  - I c (G)  and  dl  - lc(D).  These  are  poly- 
nomials in  Z [y  ,...,y  1 independent  of  x.  Ue  can  compute 
1 v 

dl"  - gcd ( f I , g')  for  later  use.  Note  that  this  is  a GCD 
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computation  of  one  less  variable,  uhich  can  hopefully  be 

done  more  easily.  In  fact,  this  computation  is,  in  most 

cases,  less  costly  than  the  calculation  of  contents  and 

primitive  parts  of  the  input  polynomials  F*  and  G’  to  the 

EZGCO  Algorithm.  Before  Step  IV  of  the  EZGCD  outline  of 

Section  1 1 1-2  (or  before  the  application  of  the  Tlul  tivar  iate 

EZ  Algorithm  in  Step  (A8)  of  the  EZGCD  Algorithm,  assuming 

F satisfies  the  test  in  (A7H,  ue  have  F,  D (x), 

0 

H (x)  - F (x)/D  (x)  such  that  F ; D H (mod  q , S)  where 
0 b 0 0 0 k 

D (x)  ■ gcd(F  (x),  G (x)).  At  this  point  ue  let 

0 b b 

D (x)  » dl"  (D  / I c (D  ))  and  H (x)  ■ lc(D  ) H (mod  q ),  then 

1 b 0 0 1 0 0 k 

the  following  are  all  trues 

(i)  I c (D  (x) ) = dl"  and  dl"  is  a known  polynomial, 

lb 

(ii)  lc(H  ( k ) ) - I c (D  ) I c (H  ) ■ I c (F  ) ■ fl  and  fl  Is  a 

1 0 0 b b 

given,  hence  unown  polynomial. 

(Hi)  dl"  F - dl"  F - 0 H (mod  q , S). 

b b 1 1 k 

Thus  by  theorem  II-5.5,  Algorithm  II-5.5  can  be  used  instead 

of  the  Hu  I tivar iate  EZ  Algorithm  in  the  EZGCD  Algorithm  in 

order  to  alleviate  the  LCB  problem,  If  the  prime  and  the 

evaluation  values  used  are  lucky,  then 

0 (x)  » D(x,b  b ) and  the  outputs  obtained  will  be 

0 1 v 


— ....  I 


£ 

| 


O’  « dl"  (D/d I ) and  H’  * dl  H such  that  dl“  F - O’  H’  over  Z. 
Since  dl  divides  dl",  cont(D’)  ■ dl"/d'  or  dl  » d I’Vcont (D* ) , 
90  that  0 - D'/cont(0’)  - pp(D’)  and  H ■ H'  coni (D* ) /dl". 

In  this  case,  there  is  no  blowup  problem  due  to  multiplica- 

n 

tion  by  9ome  unit  u in  S or  1/u  as  pointed  out  in  Section 
II-5(a).  Therefore,  the  leading  coefficient  problem  for  the 
Hensel  construction  in  EZGCO  Algorithm  is  avoided  at  the 
relatively  small  cost  of  computing  dl"  ■ gcdtfl,  gl)  and 
using  dl"  F for  the  EZ  Algorithm  instead  of  F.  Ue  modify 
the  EZGCD  Algorithm  by  changing  Step  (A8)  as  folk/ws: 

(A8’l  Compute  dl"  - gcd ( I c (F) , I c (G) ) . 

Set  U"  <—  dl"  U,  D <--  dl"  (D  / lc(D  ’/),  and 

1 b 1 'i 

H <—  I c (0  ) H . 

1 1 1 

If  U - F,  then  apply  Algorithm  1 1 -5.5  on  U",  D , H , 

1 1 

dl,  and  I c (F) , otherwise  U G,  then  apply  Algorithm 

1 1-5.5  on  U",  0 , H , dl,  and  lc(G),  to  replace  the  use 
1 1 

of  the  Multivariate  EZ  Algorithm  in  (A8).  If  the 
results  are  D"  and  H"  such  that  U"  * D"  H"  over  Z, 
then  clearly  U - pp(U")  - pp(D")  pp(H")  ■ D H with 
0 - pp(0")  and  H » pp(H"),  then  continue  to  A (3). 
Otherwise,  as  in  (A8).  b must  be  an  unlucky  evalua- 
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t i on,  so  go  to  (AS). 

Example  II 1-4.1:  In  the  rest  of  thi s section  we  will 

compute  the  multivariate  SCO  of  two  given  non-monic  poly- 
nomials using  the  EZGCD  Algorithm.  Hopefully  this  example 
will  demonstrate  many  essential  features  of  this  algorithm, 
especially  the  way  the  leading  coefficients  are  handled  (as 
in  (A8’l).  To  avoid  using  too  lengthy,  we  will  go  through 
the  steps  of  the  algorithm  outline  in  Section  III-2  instead 
of  following  the  detailed  steps  of  the  EZGCD  Algorithm. 
However,  the  modified  version  of  the  multivariate  EZ  Algo- 
rithm (I  I -5.5.  as  in  (A8’))  rather  than  the  regular  version 
(Algorithm  1 1-5.4,  as  in  (A8H  will  be  used.  Also,  the 
spocial  case  method  is  not  needed  for  this  example. 

Assume  the  problem  is  to  compute  the  GCD,  O’,  of  two 
given  multivariate  polynomials  F’  and  G’  where 

2 2 2 2 2 3 

F’  - y x +(-yz  - y z+yz+y  +2y)x  + yz 

3 2 

- z -y  z-yz+2z  and 

3 3 2 3 2 

G’  - y x + l-y  z - y z + y z)  x 

3 3 2 2 

+ (yz  - z -y  + 2 y)  x + y z-3yz  + 2z. 

It  is  easily  realized  that  F*  and  G’  are  primitive,  so  that 

d'  » f*  = g*  = 1 , F » F' , G ■ G' , and  D'  « D ■ gcd(F,  G). 

By  choosing  b - II,  01  which  is  clearly  a valid  evaluation 


"tittfiit-  - - 


2 3 

for  F and  G,  we  have  F <x)  - x + 3 x,  G (x)  - x + x,  and 

b b 

D (x)  = gcdlF  . G ) * x.  Since  deg(D  (x))  • 1 / deg(F)  or 
0 b b 0 

deg(G),  we  test  Condition  III -A  and  find 

gcd(0  , F /D  ) = gcd(x,  x+3)  ■ I.  So  we  continue  to  Step  IV 
0 b 0 

assuming  Condition  III-B,  deg(0  ) - 1 - deg(D).  Nou  the 

0 

Hensel  construction  needs  to  be  applied  on  F,  D , and  F /D  , 

0 b 0 

but  the  multivariate  leading  coefficients  of  D and  F /0 

0 b 0 

are  not  knoun.  So  according  to  (A8* ) » we  compute 


dl”  = gcd(lc(F),  I c (G ) ) * y . Then  we  set 

U <—  dl"  F,  D <-  dl"  (0  / 1 c (D  ))  - x and 

1 0 0 0 

H < I c (0  ) F /D  ■ x + 2,  but  now  we  know  that  the  mul- 

1 10  0 

tivariate  leading  coefficients  of  D and  H are  both  equal 

1  1 


to  y . So  we  can  apply  the  modified  multivariate  EZ  Algo- 


rithm ( i i -5.5)  to  U,  0 , H , with  y as  leading  coeffi- 

11 

cients.  This  problem  is  exactly  the  same  as  Example  II-5.5 

2  2 

and  the  results  of  that  are  0"UG')  - y x + yz-y  zand 

4 
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- y x + y - yz  + 2y  such  that  U - D"  H"  over  Z. 


Thus  ue  avoided  the  LCB  problem  as  pointed  out  by  Example 


2 3 2 

1 1-5.5  at  the  cost  of  computing  dlH  ■ gcdly  , y ) - y . Nou 


ue  note  that  F - pp (U)  - pp(Q")  pp(H")  - D H over  Z,  so  that 


D-yx  + z-  yz  and  H-yx  + y-  z +2.  Since  Dali 


2 2 2 

divides  G (G  » D (y  x - z x - y + 2)  over  Z),  ue  conclude 


that  0’  » D = gcd(F,  G)  * gcd(F',  G')  «yx  + z-  yz. 


Ill  - 5 Conclusions 


This  chapter  has  demonstrated  the  applicability  of  the 


Hensel  construction  to  the  computation  of  polynomial 


greatest  common  divisors,  especially  in  the  multivariate 


case.  A complete  GCD  algorithm  uas  presented  uhich  uses  the 


Hensel  construction  for  all  situations.  UN I GCD  Algorithm 


shoued  the  feasibilit.  * using  Zassenhaus'  Quadratic 


Extension  Algorithm  for  calculating  univariate  GCD’s,  hence 


avoiding  the  computation  of  many  modular  images  of  a GCD 


over  the  integers.  By  taking  advantage  of  the  sparseness 


usually  exhibited  by  multivariate  polynomials,  the  EZGCD 


Algorithm  achieves  greater  computational  efficiency  at  the 


very  small  cost  of  having  the  given  polynomials  satisfy  a 


quite  minor  condition  (Condition  III -A ) . Careful  and 


detailed  design  and  implementat;on  of  this  algorithm 


I 
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revealed  the  possibility  of  using  the  Hensel  construction 
for  other  fundamental  polynomial  operations.  These  include 
the  semi  -para  I lei  computation  o."  contents  and  primitive 
parts,  the  square-free  decompositions  of  polynomials,  and  a 
different  view  of  the  pseudo-division  of  polynomials  via  the 
solution  of  polynomial  equations. 


I 


■'■—T-  — ' 'S'" 


155 

CHAPTER  IV  COnPUTING  TIME  ANALYSIS  AND  EMPIRICAL  RESULTS 
IV  - 1 Introduction 

The  practical  value  of  an  algorithm  '19  measured  by  its 
"efficiency"  and  the  most  natural  quantitative  measure  of 
efficiency  i9  the  computing  time  of  the  algorithm  for  a 
particular  problem.  But  computing  times  do  not  always 
reveal  tha  whole  truth  because  of  the  many  varying  factors 
that  have  an  influence  on  their  values,  such  as  the  computer 
hardware,  the  supporting  software,  and  the  algorithm 
implementation  and  implementer.  Thus,  in  order  to  evaluate 
an  algorithm,  it  is  important  to  perform  some  theoretical 
analysis  of  the  algorithm  independent  of  the  implementation. 
In  analyzing  an  algorithm  one  9hould  strive  to  correlate 
predicted  performance  with  the  actual  computing  time,  so 
that  one  can  attempt  to  verify  the  validity  of  the  analysis 
and  the  correctness  of  the  implementation  simultaneously. 

Methods  of  analysis  which  attempt  to  achieve  these 
goals  have  been  developed  and  popularized  in  the  recent 
year 9 by  Knuth  [KNUS9I  and  (particularly  for  algebraic 
algorithms)  by  Collins  IC0LG9) , IC0L7L . and  Brown  IBR071) . 
Ue  will  briefly  discuss  some  of  these  methods  o.,d  point  out 
some  of  their  weaknesses.  Then  a somewhat  different  and 
more  intuitive  new  method  (developed  with  9ome  essential 
innovations  of  Moses)  will  be  proposed  for  analyzing  the 


- 


a »r»5  r.« 


jus t described  Hen3el  construction  and  the  EZGCO  Algorithm. 


Finally,  in  this  chapter,  we  will  show  how  well  the  analysis 


is  substantiated  by  some  empirical  test  data.  A similar 


philosophy  regarding  computing  time  analysis  was 


independently  arrived  at  by  Gentleman  CGEN731 . Fateman 


[FAY73]  also  presents  a similar  point  of  view  and 


criticizes  the  worst-case  analysis  of  algorithms  for 


asymptotically  large  problems  as  essentially  misleading.  He 


analyzes  several  algorithms  for  making  one  particular 


polynomial  computation  by  comparing  actual  timing  and  counts 


of  basic  operations  (additions,  multiplications,  etc.)  to 


the  total  run-times  of  the  implementations  for  these 


algor i thms. 


Although  both  time  and  space  requirements  of  an 


algorithm  are  important,  it  is  often  much  easier  to 


concentrate  on  one  aspect  rather  than  the  "two  dimensional" 


problem.  Ue  shall  concentrate  mainly  on  the  analysis  of  the 


computing  time  of  algorithms,  and  simply  neglect  space 


requirements  (or  assume  that  the  available  storage  space  is 


infinite). 


One  notation  used  in  the  analysis  of  algorithms  is  the 


"big-oh"  notation  [KNUG91 , which  has  the  following  meanlngt 


the  number  x(n)  represented  by  0 ( f (n) ) (big-oh  of  some 


function  of  n) , where  f(n)  is  a function  of  the  positive 


integer  n,  satisfies  the  condition  |x(n)|  < fl  |f(n)|  for 


- 
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some  positive  constant  H.  Unfortunately 
not  very  suitable  for  the  analysis  if  al 


this  0-notation  is 
gebraic  algor i thms. 


So  Collins  and  Broun  adopted  the  more  general  notion  of 


dominance.  Let  f and  g be  real -valued  functions  defined  on 
some  set  S.  Then  f is  said  to  be  dominated  by  g or  g bounds 
f,  if  there  is  a positive  real  number  c such  that  f(x)  < c 
g(x)  for  all  x in  S.  If  f is  also  a hound  for  g (f  bounds 
g) , then  f and  g are  said  to  be  codominant.  Note  the 
similarity  to  the  0-notation  but  the  generality  of  S,  or  x 
in  S.  makes  it  possible  for  f and  g to  be  functions  of 
several  variables.  It  is  clear  that  codominance  is  an 
equivalence  relation.  Husser  (MUS71,  Theorem  OJ  lists  some 
properties  of  the  dominance  relation  uhich  are  straight- 
foruard  consequences  of  the  definitions.  This  notion  of  a 
bound  is  useful  and  convenient  for  many  problems  but  it  also 
has  some  draubacks.  One  defect  is  that  the  real  constant  c 
(uhich  is  ordinarily  unknoun)  is  sometimes  necessarily  so 
large  that  the  bound  is  good  only  for  a small  portion  of  the 
x’s  in  S and  is  unrealistic  for  all  other  elements  of  S. 

Also,  since  the  definition  fcr  dominance  encompasses  all 
elements  x of  S,  a bound  for  a given  function  often  turns 
out  to  be  an  asymptotic  or  "uorst  case"  bound  and  provides 
little  information  about  the  actual  behavior  of  the  function 
for  the  "average"  cases.  These  phenomena  have  been 
mentioned  already  in  Section  1-2  uhere  ue  quoted  Broun’ s 


computing  time  formulas  for  the  Reduced  PRS  and  the  Modular 
GCD  Algorithms.  Nevertheless,  for  many  simpler  algorithms, 
this  method  of  analysis  still  provides  valuable  intuition 
and  ue  will  utilize  it  whenever  it  is  appropriate  for  our 
computing  time  analyses. 

Before  ue  proceed,  we  will  now  formalize  the  notions  of 
dense  and  sparse  polynomials.  A v-variable  polynomial 
having  maximum  degree  d in  the  ith  variable  will  be 

considered  dense  if  it  contains  all  the  possible  monomial 

terms  (products  of  pouers  of  variables)  uith  ith  degree  < d . 

i 

Thus,  a derise  univariate  polynomial  in  x of  degree  d ui  I I 

d 

have  all  pouers  of  x — l,x x , so  that  it  has  d+1  terms. 

A dense  v variable  polynomial  with  maximum  degree  d , then, 

i 


has 


v - 6,  the  polynomial  will  have  over  15,000  terms.  A pre- 
sent-day symbolic  computing  facility  can  hardly  perform  any 
non-trivial  algebraic  operations  on  such  a polynomial.  This 
is  an  important  objection  to  asymptotic  analyses  of  some 
algebraic  algorithms  and  the  reason  for  the  more  intuitive 
analysis  for  practically  sized  problems. 


159 


b 
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Any  polynomial  which  has  a significantly  smaller  number 
of  terms  than  that  of  the  dense  polynomial  having  the  same 
degree  in  each  of  the  variables  is  considered  sparse. 
According  to  this  description  of  sparseness,  the  dividing 
line  between  the  classes  of  sparse  and  dense  polynomials  is 
certainly  not  distinct.  It  ie  often  left  to  personal 
judgment  to  determine  what  is  a reasonable  percentage  of 
non-zero  terms  for  which  a polynomial  will  be  considered 
sparse. 

These  are  what  we  prefer  to  call  the  "autonomous" 
definitions  of  denseness  and  sparseness  in  that  they  can  be 
determined  by  knowing  the  polynomial  itself.  Another 
"relative"  definition,  originally  used  in  relation  to 
polynomial  multiplications,  is  more  preferable  in  analyzing 
in  the  Hensel-type  algorithms.  The  two  different  points  of 
view  are  actually  very  closely  related  as  we  will  soon 
i I lustrate. 

Two  polynomials  are  considered  completelu  sparse  if, 
when  we  form  their  product,  there  is  no  combination  of  terms 
by  addition  at  all.  Thus  if  one  polynomial  has  m terms  and 
the  other  n terns,  then  their  product  u II  have  mn  terms. 

Two  polynomials  of  m and  n terms  respectively  will  be 
considered  dense  if  their  product  has  m+n-1  terms.  As 
pointed  out  by  Johnson  and  Gentleman  [J4G73I,  this  can  only 
be  attained  when  the  polynomials  are  univariate  with  no  non- 


ji 
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zero  coefficients.  For  autonomously  dense  multivariate 
polynomials  of  degree  d in  each  of  the  v variables,  the 
product  has  (2  d + l)**v  terms  but  the  computation  of  +he 
product  by  this  usual  algorithm  involves  (d  + 1)**(2  v) 
term-by-term  multiplications.  Thus  the  ratio  of  non- 
cancel led  terms  to  the  possible  terms  is  approximately 
(2/ td  + l))**v.  So  we  extend  the  definition  of  relative 
denseness  to  equal-degreed  (having  the  same  maximum  degree  d 
in  each  variable)  multivariate  polynomials  of  m and  n terms 
respectively  which  yield  a product  of  k terms  such  that 
k/(m  n)  < (2/ (d  + l))**v. 

Among  the  basic  arithmetic  operations  for  polynomials, 
multiplication  turns  out  to  be  the  most  important  and 
relevant,  since  addition  is  much  less  costly  for  oi»r 
polynomial  representation.  The  classical  method  for 
performing  the  multiplication  of  two  polynomials  P and  Q to 
get  their  product  R (■  P Q)  involves  term-by-term  multiplica 
tions.  So,  if  T(P)  denotes  the  number  of  terms  in  a poly 
nomial  P,  then  the  cost  of  the  polynomial  multiplication, 

R <—  P Q,  is  at  least  T (P)T(Q) . In  fact,  this  is  the  often 
accepted  computing  cost  for  multiplication  (e.g.  by  Brown 
(BR0711,  Johnson,  and  Gentleman  IJSG73I),  especially  for  the 
extreme  cases  where  the  polynomials  involved  are  completely 
sparse  or  dense.  For  the  completely  sparse  case, 

T(P)  T (Q)  - T (R)  which  is  the  actual  computing  time  often 
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observed  for  sparse  polynomials.  Once  these  terms  are 
generated,  it  is  necessary  to  sort  them  into  some  particular 
canonically  simpl i tied  representations.  Thus,  including  the 
sorting,  polynomial  multiplication  is  bounded  by 
T (P)T(Q) log(T(P)T(Qi).  But  that  assumes  no  ordering  on  the 
terms  in  P or  Q.  Uhen  the  ordering  of,  say,  P is  taken  into 
account,  the  bound  for  multiplication  of  sparse  polynomials 
can  be  reduced  to  T (P) T (Q)  I og (T (P) ) (as  ALTRAN  has  been  able 
to  accomplish).  It,  then,  seems  reasonable  to  expect  the 
computing  time  of  a multiplication  algorithm  to  be  propor 
tional  to  T(P)T(Q)  by  taking  advantages  of  the  orderings 
from  both  input  polynomials  (see  IFRO001  for  relevant 
discussions).  In  fact,  such  an  improvement  can  be 
accomplished  for  the  dense  case.  If  the  product  is  dense 
the  classical  method  is  not  the  best  uay  for  this 
computation.  The  reason  is  that  terms  which  combine  do  not 
have  to  be  computed  separately.  The  technique  of  using  the 
fast  finite  Fourier  transform  to  do  multiplication  turns  out 
to  achieve  the  (presently)  minimal  computing  time  bound  of 
T (R)  I og  (T  (R) ) [BON733  . Ue  see  that  when  the  po  I ynor,i  i a I s 
involved  are  all  dense,  it  is  possible  to  use  the  above 
lower  bound  which  in  general  is  already  bounded  by  T(P)T(Q) 
(since  (2d+l)**v*logt '(2d+l)**v)  s in  general  less  than 
(d+l)**(2  v)  for  v > l).  An  even  stronger  iirgument  can  be 
made  for  this  when  the  algorithm  being  analyzed  performs 
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polynomial  multiplications  only  as  a part  of  all  the 
necessary  polynomial  operations.  Ue  agree  completely  uith 
those  who  argue  that  the  cost  of  multiplication  is  at  least 
T(P)T(Q)  uhen  an  algorithm  is  used  only  for  multiplication 
or  is  dominated  by  some  multiplication  process.  Thus  it 
9eems  reasonable  to  use  T (P) T (Q)  as  the  estimate  for  the 
cost  of  multiplication  for  all  cases,  except  possibly  uhen 
the  product  is  dense  uhere  ue  can  use  T(R)  I og (HR) ) as  the 
bound.  As  already  noted  in  Chapter  I,  multivariate 
polynomials  of  high  degree  are  inevitably  sparse  in 
practical  situations,  in  fact,  exceedingly  sparse,  so  that 
the  T(P)T(Q)  computing  cost  uill  be  a fair  measure  for  a 
uide  range  of  practical  situations.  The  correctness  of 
using  this  formula  for  multiplication,  houever,  uill  be 
confirmed  by  many  empirical  computing  results  in  Section  IV- 
3. 


E 


IV  - 2 Analysis  of  Computing  Costs 

In  this  section,  ue  uill  attempt  to  give  some 
reasonable  estimates  of  computing  costs  for  the  algorithms 
discussed  in  the  previous  two  chapters.  Many  of  the 
asymptotic  computing  time9  for  common  arithmetic  operations, 
especially  for  dense  univariate  polynomials,  uill  be  based 
or.  Broun' s analysis  [BR071I,  Ue  uill  often  assume 
univariate  polynomials  to  be  dense  and  use  the  asymptotic 
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computing  times  for  many  operations  on  them.  But  we  will 
also  discuss  computing  times  for  average  sized,  non-dense 
multivariate  polynomials  since  dense  polynomials  uith  many 
variables  of  high  degree  seldom  occur  in  practical 
computations. 

The  computing  time  for  the  Extended  Euclidean  Algorithm 
(algorithm  1 1 -2.1)  in  (Z/q)  [x]  whon  q is  bounded  by  some 
fixed  length  integer  is  of  the  sane  order  as  the  computing 
time  for  the  Euclidean  algorithm,  so  that  it  is  bounded  by 
deg (G)  (deg (F)  - deg(O)).  1 he  reason  for  this  is  clear  from 
the  fact  that  the  A and  B produced  by  the  algorithm  are 
obtained  by  multiplications  where  the  final  products,  A and 
B,  satisfy  the  conditions  deg(A)  < deg(G),  deg(B)  < deg(F). 

For  Algorithm  1 1-2.2,  there  are  essentially  three  poly- 
nomial multiplications  (A’  C),  (B’  C) . (Q  F)  and  a division 
with  remainder  A’  C ■ Q G + R.  Their  costs  are  bounded  by 
deg(G)  (deg(H)  - deg(O)  + 1),  deg(F)  (deg(H)  - deg(O)  + 1), 
(deg (HI  - deg (0)1  (deg (F)  + 1),  and 
(deg  (HI  - deg  (Dll  (deg(G)  + 1),  respectively.  The 
approximate  total  cost  is,  then 

2 (deg(F)  + deg (G)  + 1)  (deg(H)  - deg(DI)  + deg(F)  + deg(G). 

For  the  univariate  Zassenhaus’  Quadratic  Extension  Algo- 
rithm 1 1-3.2:  Step  (1)  costs  deg(G)  deg(H):  Step  (2)  and  Step 
(4)  are  bounded  by  T (G)  T (HI : in  Steps  (3)  and  (Si,  since 
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deg (C)  and  deg (R ) < deg(G)  + deg(H)  and  deg(D)  • 0,  the 
costs  are  bounded  by 

2 

2 (deg (G)  + deg (H) ) (deg(G)  + deg (H) ) + deg  (H)  or 
2 2 

(deg (G)  + deg(H))  or  simply  deg  (F).  Since  Steps  (2)  — (5) 

are  carried  out  k times  and  assume  T (G)  - T(G  ), 

0 

T(H)  ■ T(H  ) for  each  step,  then  the  total  cost  iB  appro- 
0 


2 

ximately  k (TIG  ) T (H  ) +2  deg  (F)).  For  dense  univariate 
0 0 

polynomials  T(F)  - deg(F)  + 1,  hence  the  total  cost  is 

2 2 

e33entiai  y dominated  by  k deg  (F)  or  k d if  deg(F)  - d. 
Note  that  here  we  have  not  considered  the  integer  length  of 
the  coefficients  in  F.  This  detail  was  omitted  on  the 
assumption  that  the  coefficients  occuring  in  F for  practical 
cases  can  all  be  expressed  by,  say,  at  most  four  3G  bit 
binary  computer  words.  In  fact,  if  we  have  infinite  preci- 
sion integral  coefficients  the  cost  for  this  algorithm  '13 
still  just  a maximum  integer  length  multiple  of  the  above 
(for  instance,  see  IMUS71]). 

The  Hensel’s  Algorithm  (II-3.1)  is  just  a simpler  and 
shorter  version  of  Algorithm  II-3.2.  Except  the  m ber  of 

k 

steps  is  k’  where  k - log(k')  or  k*  ■ 2 . Thus  the  cost  of 


the  Hensel’s  Algorithm  is  k’  (T  CG  ) T(H  ) +2  deg  (F)  J . 

1 1 

For  the  analysis  of  the  Generalized  Hensel  Algorithm 
1 1 -4.1,  it  is  necessary  to  first  consider  the  computing 
cost  of  the  substitution 

p’  (y y > <--  Ply  +b y +b  ).  Let  d , i - 1 v 

1 v 1 1 v v i 

be  the  maximum  degrees  of  variables  y in  Ply  ,..,,y  ). 

1 1 v 

r J 

For  each  i,  the  computation  of  (y  + b ) for  j - 1 d 

i i 

requires  2(2  + 3 + ...  + d ) - (d  - 1)  (d  + 2)  term  by 


term  multiplications.  Thus 


^^(d  - 1)  (d 


+ 2)  term 


operations  are  needed  for  computing  all  the  powers  for  the 

substitutions  of  the  variables.  Assume  now  the  polynomial 

P is  put  in  fully  expanded  form  in  J[y  ,...,y  ].  Then 

1 v 

k k k 

1 2 v 

a typical  monomial  is  y y ...  y and  the  cost  for  mul- 

1 2 v 


tiplying  the  k degree  dense  polynomials  of  (y  + b ) al- 
i i I 
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ready  prepared 


is  | P Ik  + 1). 


If  the  fully  expanded  poly- 


1-1 


nomial  P cont  ns  TIP)  terms  or  monomials,  then  the  total 

cost  of  subst.  v ,.;iion  to  get  P*  Is  (»)  

v 


TIP)  v 

£TT*„ 

j-1  i«l 


E 


Ik  + 1)  + / Id  - 1)  Id  +2).  INote  that  a 
i i 

i-1 


brief  analysis  of  substitution  of  this  type  with  Horner’3 
rule  not  only  shous  the  same  order  but  also  reveals  its  in- 
efficiencies in  redundant  operations  for  these  multiple  sub- 
stitutions. In  a private  communication  uith  E.  Horowitz, 
the  at ove  statement  was  also  verified  by  his  computing 
time  formulas  also  IH0R73I.) 

If  n is  the  maximum  term  degree  of  P in  the  variables 

y .....  y , then  n bounds  all  k + ...  + k , so  that  the 
1 v i j vj 

total  cost  of  substitution  is  bounded  by 

v 2 2 2 

T IP)  (n/v)  + d + d + . . . + d . 

1 2 v 


If  P is  a dense  polynomial  of  degree  d in  each  variable, 


then  the  cost  bound  becomes 


E-ETTv1'  TTv 


k -0  k -0  i«l 
1 v 


U id  + 1)  id  + 2)/2  +yy  d . 


or  approx  I mate  I Vj 


2v  v 2 

d /2  + v d which  is  roughly  the  cost  of  multiplying  P 
by  itself. 

It  is  important  to  point  out  that  the  number  of  terms 

in  P’  could  become  as  large  as  a dense  polynomial  of  degree 

d , ...  , d which  could  be  much  larger  than  TIP).  Hence, 

1 v 

not  only  may  the  computing  cost  of  substitution  be  expensive, 
but  also  this  type  of  blouup  in  the  number  of  terms  may 
cause  computational  inefficiencies  in  some  unfortunate  cases, 
as  we  will  see  later  in  this  chapter.  Therefore,  it  is  im- 
portant to  choose  as  many  zeros  in  the  set  of  substitution 

values  b » lb  , ...  , b I whenever  possible.  In  the  expres- 
1 v 

sion  (*) , if  any  b - 0 then  us  can  accordingly  set  k to 
i’  i ’ j 

0 and  d to  1 to  indicate  that  essentially  no  work  is  invol- 


ved for  this  variable.  The  result  ini  savings  in  computing 
cost  for  the  Generalized  Hensel  Algorithm  is  obviously  great 
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for  each  additional  zero  b . This  blowup  problem  due  to  non- 

i 

zero  substitutions  will  be  called  the  non-zero-substi tut  ion 
problem  and  will  be  discussed  later  for  aome  individual 
algor i thms. 

Let  the  input  polynomial  F of  the  Generalized  Hensel 

Algorithm  have  d , i - l,...,v,  as  the  maximum  degree  In  the 

i 

non-main  variable  y . Then  Step  (1)  of  the  GHA  ie  the 
i 

substitution  operation  whose  cost  is  bounded  by 
v 2 2 

T (F)  (n/v)  + d + ...  + d , if  we  assume  essentially  no 

1 v 

additional  cost  for  multiplying  a coefficient  polynomial  In 

x,  C(x),  with  a monomial  in  the  other  variables  as  compared 

with  multiplying  a constant  and  monomial.  The  cost  of  Step 

(2)  is  deg(G  ) deg(H  ).  For  Step  (3),  we  observe  that  the 

1 1 

I 

multiplication  of  an  univariate  polynomial  in  x by  x in- 
volves essentially  a shift  operation  or  additions  of  the 
degrees  by  a constant,  i.  Thus,  in  the  deg(F)  applications 
of  Algorithm  1 1-2.2,  only  the  division  steps  and  one  multi- 
plication need  to  be  counted.  The  cost  of  these  operations 
deg(F) 

UdeglH  ) + i)  degIH  ) + i degIG  )]  - 
1 1 1 
i-1 
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deg(F)  deg  (H  ) + deg(F)  (deg (F) (deg (F)  + l)/2)  or  approx i- 
1 

3 

mately  bounded  by  deg  (F)/2.  Because  of  the  iterative  update 
of  D,  carrying  out  Step  (5)  up  to  n times  constitutes  a mul- 
tiplication of  G and  H , hence  the  total  cost  is 
n n 

T (G  ) T (H  ) . Step  (4)  has  negligible  cost  and  Step  (G)  only 
n n 

involves  "scalar"  multiplications.  Step  (7)  is  again  a 
substitution  step  for  polynomials  smaller  in  size  than  F in 
Step  (1),  hence  it  is  bounded  by 
v v 

T(G  ) (n/v)  + T(H  ) (n/v)  . In  fact  the  total  cost  of  the 

n n 

Generalized  Hensel  Algorithm  is  bounded  by 

v 2 2 3 

(T(F)  + T (G  ) + T(H  ))  (n/v)  + d + ...  + d + deg  (F)/2  + 
n n 1 v 

T(G  ) T(H  ) or  simply  by 
n n 

v 

T (G  ) T (H  ) + (T (F)  + T(G  ) ♦ T(H  ))  (n/v)  (El). 

n n n n 

For  the  Univariate  EZ  Algorithm  11-5.3,  there  is  one 
additional  cost  to  the  cost  of  applying  Zassenhaus’  Quadra- 
tic Extension  Algorithm.  That  is  the  computing  cost  of 
cont(H)  and  pp(H)  which  essentially  involves  at  most  deg(H) 
integral  GCO  computations  with  each  bounded  by  the  maximum 


integral  length  squared.  For  fixed  length  integers  th i 3 is 
a cost  depending  essentially  only  on  deg(H).  Houever,  in 
the  case  of  Multivariate  FZ  Algorithm,  the  situation  is 
someuhat  different.  Not  only  do  cont(H)  and  pp(H)  consti- 
tute additional  costs  to  that  of  GHA,  but  also  the  cost  of 
multivariate  division  and  multiplication  for  computing 


H and  G mod  S needs  to  be  counted.  Houever,  Lemma 
II  - 5.5  shous  that  it  is  possible  to  avoid  this  expense  at 
the  cost  of  predetermining  leading  coefficients  of  the  divi- 
sors. So  ue  uill  delay  analysis  of  this  problem  until  later 
uhen  ue  discuss  practical  applications  of  the  principles  of 
Lemma  II  - 5.5.  Furthermore,  the  follouing  observations 
uill  decrease  the  severity  of  this  problem. 

The  above  analysis  for  the  GHA  is  based  essentially  on 
the  sum  of  the  dominating  costs  for  each  individual  step. 
From  the  expression  (El)  ue  find  that  the  costs  for 
substitutions  and  multiplications  seem  to  be  the  most 
significant.  But  the  difficulty  in  using  this  expression 
for  an  estimate  of  computing  cost  lies  in  the  fact  that  the 
basic  operations  of  substitution  and  multiplication  are 
different,  so  that  their  basic  unit  costs  are  also 
different.  In  the  manipulation  of  multivariate  polynomials, 
there  can  be  many  different  basic  operations  that  a complete 
timing  analysis  should  include.  But  these  operations  often 


I 


have  varying  cost  units,  unlike  many  algorithmic  analyses  in 
complexity  theory  where  the  most  costly  basic  operations  are 
often  either  a comparison  of  two  records  or  a multiplication 
of  two  fixed  point  integers.  It  is  then  clearly 
advantageous  to  find  some  particular  parameters,  for  an 
algorithm  involving  many  different  basic  operations,  on 
whirn  essentially  all  relevant  computing  costs  depend 
somewhat  explicitly.  For  this  basic  reason  as  well  as  the 
very  sound  argument  that  the  following  variation  of  cost 
analysis  can  be  more  easily  substantiated  by  actual  run 
times,  we  propose  a new  formula  for  the  total  proportional 
cost  of  the  GHA.  The  main  parameter  for  the  computing  coet 
formula,  which  we  will  attempt  to  establish  by  the  following 
arguments  and  actual  computing  timee,  is  the  number  of  terms 
for  the  polynomials. 


Let  F’  denote  F(x,y  +b  ,...,y  +b  ).  Since  the  most 
11  v v 


important  operations  for  the  GHA  is  to  correct  for  terms  in 


R - F’  - G H at  the  mth  iteration,  after  n iteratione 
m mm 


the  algorithm  would  have  corrected  for  all  the  croee  terms 


(1)  In-1) 

of  G H with  G ■ G + G + ...  + G and 
n n n 1 


(1)  (n-1) 

H-H+H  +...+H  .For  each  dietinct  term  in 
n 1 


i 


1 


i 
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this  product,  some  work  is  involved.  That  work  includes  the 
multiplication  that  produced  the  term;  isolating  this  term 
by  finding  the  correct  integral  coefficient,  the  power  of  x, 
and  the  monomial  in  the  other  variables;  solving  the  corres- 
ponding DUPE  for  the  term;  and  finally  assembling  the  solu- 
tions of  the  DUPE  uith  the  monomial  and  adding  the  results 
to  the  two  current  codivisors.  Quite  clearly,  all  this 
work  for  one  term  dominates  in  cost  over  any  other  basic 
operations  such  as  sorting,  comparing,  multiplying  or 
dividing  by  integers,  etc.  In  fart,  even  the  content  and 
primitive  part  computations  in  the  restore-leading-coeffi- 
cient operations  correspond  roughly  to  these  individual 
terms.  Thus  it  is  quite  reasonable  to  expect  that  the  total 
computing  cost  for  GHA  is  proportional  to  the  number  of 
distinct  terms  that  the  algorithm  has  to  deal  with,  and  that 


is  approximately  (or  at  least  bounded  by)  T(G  ) T(H  ), 

n n 


except  when  F'  is  dense.  In  such  a case  F'  would  contain 


all  possible  terms  that  the  product  of  G and  H could  gene- 

n n 


rate.  In  this  case,  T(r')  is  the  number  of  terms  that  GHA 
has  to  deal  with.  Therefore,  the  total  cost  of  GHA  should 


be  approximately  proportional  to  (TIG  ) T(H  ) or 

n n 


nr’))  + cost  of  substitution.  Actually  the  substitution 
cost  can  generally  be  included  in  the  first  part  of  the 





above  expression  Bince  i t is  (T(F)  + TIG  ) T(H  ))  (n/v)**v. 

n n 

Clearly,  if  F’  is  dense,  then  T(F)  < T(F’),  T(G  ) < T(F’), 

n 

and  T (H  ) < T(F');  whereas  if 
n 

max(T(G  ) T (H  ),  T (F* ) ) - T(G  I TtH  ) this  T (G  ) T(H  ) still 
n n n n n n 

dominates  ( T (F)  + T(G  ) + T (H  ))  in  general.  Therefore,  we 

n n 

can  even  go  further  and  simply  claim  that  the  coet  of  GHA  is 

proportional  to  T (G  ) T (H  )©T(F’)  (E2). 

n n 

The  ”@r  is  used  here  in  the  special  sense  that  the  total 

cost  is  proportional  to  T (G  ) T (H  ) in  general  except 

n n 

when  F’  is  dense,  then  it  is  mainly  proportional  to  T(F’). 

There  is  one  qualification  necessary  for  the  dense  case. 

Even  though  T(F’)  is  the  main  parameter  which  many  basic 

operation  costs  in  GHA  are  proportional  to,  the  classical 

term-by-term  multiplication  cost  which  is  on  the  order  of 

T (G  ) T (H  ) and  is  non-linear  with  respect  to  T(F’)  will 
n n 

begin  to  take  a significant  part  of  the  total  GHA  computing 

cost  if  G and  H are  also  quite  dense  and  have  a largo 
n n 

number  of  terms.  This  means  that  as  the  problem  size  in- 
creases in  the  dense  case,  the  multiplication  cost  can  begin 
to  dominate  the  costs  of  other  operations.  For  example,  in 
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Case  2 of  the  empirical  test  data  of  the  next  section  ue 
explicitly  recordef  the  time  for  the  Hensel  constructions  on 
linearly  dense  quartic  polynomials  F and  the  related  multi- 
plication time  (in  seconds): 


V 

- 1 

2 

3 

4 

5 

nui tp. 

0.10 

0.27 

0.59 

1.15 

2.10 

Hensel 

0.4G 

0.76 

1.35 

2.28 

3.80 

% 

22% 

34% 

43% 

50% 

55% 

Ue  notf  that  the  percentage  of  computing  time  for  multipli- 
cation in  the  GHA  in  this  problem  is  increasing  as  the 
numbur  of  variables  increases.  But  as  ue  will  see  from  the 
"ratio  tests"  in  the  next  section  the  linear  behavior  u.r.t. 
T(F’)  still  dominates  over  the  quadratic  contributions  from 

the  multiplication  of  G and  H , Similar  recordings  of 

n n 

multiplication  times  and  total  GHA  time  uere  also  made  for 
other  cases  of  empirical  test  data  in  the  next  section.  For 
the  non-dense  cases,  the  percentage  of  multiplication  time 
has  never  exceeded  20%  of  the  total  GHA  time;  whereas  the 
dense  cases  show  a upper  bound  of  60%.  At  the  point  of  60%, 
houever,  the  test  problems  involve  polynomials  of  more  than 
1875  terms  uhich  are  quite  large  as  far  as  the  polynomials 
that  can  reasonably  be  handled  by  a present-day  symbolic 
system  are  concerned.  Thus,  for  the  class  of  problems  where 
the  polynomial  sizes  are  not  extremely  large  so  as  to 
prohibit  computation  in  present-day  symbolic  systems, 
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formula  (E2)  gives  a good  estimate  of  the  work  involved  to  a 
reliable  accuracy  even  for  the  dense  case).  Furthermore,  we 
repeat,  large  multivariate  polynomials  are  inevitably 
sparse.  5u  that  our  computing  cost  estimate  fo’  ?HA  by  (E2) 
will  hold  for  quite  a wide  range  of  problems. 

Ue  now  make  estimates  of  the  C09t  of  computing  CCD’s  of 
multivariate  polynomials  using  the  EZGCO  Algorithm  under  the 
following  assumptions: 

(ASM1 ) The  integer  coefficients  of  the  input  polynomials 
and  the  resulting  polynomials  after  solution  with 
the  chosen  b-values  are  small  (w.r.t.  the  maximum 
single  precision  integers  in  most  present-day  com- 
puters) so  that  the  total  computing  time  is  not 
significantly  dependent  on  the  coefficient  sizes  or 
the  time  for  arithmetical  operations  therof.  (See 
(A3)  and  (A4)  of  IBR0711). 

(ASf12)  Unlucl'y  primes  and  evaluation  values  are  never  used 
in  the  Hen9el  constructions  (i.e.  Step  (A4)  of  the 
EZGCO  Algorithm  ensures  Condition  IH-B  or  a lucky 
evaluation  for  later  steps).  (See  (A2)  of  EBR0713 ) . 
(ASM3)  The  cost  of  contend  computations  i9  negligible  com- 
pared to  the  actual  cost  of  computing  the  GCO  of  the 
primitive  parts  of  input  polynomials. 

(ASM4)  There  is  no  need  to  use  the  special  case  algorithm 
(SI)  - (S7)  (i.e.  Condition  1 1 1 -A  holds). 


>-  i..  ...  ...... 


Since  Steps  (Al)  - (AS)  are  simply  more  detailed 

statements  of  Steps  I - /of  the  outline  of  the  EZGCO 

Algorithm,  in  Section  1 1 1-2,  we  nil  I examine  these  five 
« 

steps.  Assumption  (ASM3)  basically  states  that  the  cost  of 
Step  I is  negligible.  In  fact,  it  assumes  that  the  cost  of 
GCD  computations  with  one  less  variable  is  negligible.  That 
includes  computing  d*  ■ gcd ( f * , g’)  and  dl"  ■ gcd(lc(F), 

I c (G) ) as  uell  as  the  restore-leading-coef f iciert 
operations.  This  is  in  general  not  correct.  Practical 
experiences  tell  us  that  the  totality  of  all  these  GCD’s  of 
one  less  variable  could  in  certain  cases  amount  to  as  much 
as  one-third  the  total  cost.  But  we  make  this  assumption  on 
the  grounds  that  the  problem  of  computing  the  GCD  of 
primitive  polynomials  merits  special  attention  so  that  f’. 
g',  and  d’  can  all  be  assumed  to  be  1,  that  I c (F)  and  I c (G> 
are  in  most  cases  much  smaller  than  F and  G,  and  that  for 
GCD  problems  the  modified  Step  (A8’)  of  Section  III -5 (a) 
helps  to  reduce  the  cost  of  the  restore-leading-coefficient 
operation  to  the  very  minimal.  Also  the  computation  of 
GCD’s  of  several  polynomials  together  will  be  discussed 
further  in  Chapter  VI. 

Steps  II  and  III  involves  only  evaluations  and 
univariate  GCO’s  which  usually  cost  much  less  than  the  other 
steps.  (ASMl)  in  effect  states  that  computing  univariate 
GCD’s  is  cheap.  Siep  V consists  of  only  one  test-division. 


-■  
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Although  the  cost  of  dividing  tuo  polynomials  by  the 
classical  method  is  asymptotically  bigger  than  the  cost  ot 
computing  GCD’s  for  polynomials  of  the  same  sizes,  in  nost 
practical  cases,  divisions  are  much  cheaper  than  GCO 
calculations.  Alternatively,  ue  can  perform  tuo  parallel 
Hensel  constructions  on  both  F and  G so  that  the  total  _ost 
will  simply  be  multiplied  by  a constant,  2,  and  the  test- 
division  can  be  eliminated  by  simply  comparing  the  resulting 
potential  GCO's  from  tiie  tuo  Hensel  constructions.  Actually 
(ASM2)  assumes  auay  the  need  for  performing ‘the  test- 
division. 

Therefore,  based  on  these  assumptions  the  most 
significant  amount  of  time  of  the  entire  algorithm  ib  spent 
in  Step  (A8)  or  (A8’)  - the  Hensel  construction.  According 
to  the  analysis  already  done  above  and  assuming  ue  use  the 
alternative  Step  (A8’l,  the  cost  of  that  is  (according  to 
(El)) 

T C sub ( (d I " /d I ) 0)  T(sub(dl  H) ) + 

v 

(T (dl"  U)  + T(sub( (dP/dUD)  + T(sub(dl  H) )]  (n/v)  (E3) 

uhere  D = gcd(F,  G) , dl  - I c (D) . dl"  ■ gcd(lc(F),  lc(G)), 

U » either  F or  G,  "sub"  denotes  the  substitution 
transformation  discussed  in  1 1-4,  T denotes  tha  number  of 
terms  in  fully  expanded  form,  and  n is  maximum  term  degree 
of  dl"  U. 


w.-,. - ..  ...... . , 
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This  formula  seems  very  complicated  so  that  ue  may  uant 
to  make  some  further  simplifying  assumptions.  First  ue 
assume 

(AStlS)  Sufficiently  many  zeros  can  be  used  in  the  valid 
lucky  evaluation  for  Hensel  construction  so  as  to 
keep  the  number  o*  terms  in  the  substitution  trans- 
formed polynomials  approximately  the  eame  as  that  of 
the  untransformed  polynomials. 

This  assumption  has  the  wide  ranging  implication  that  makes 
'he  substitution  cost  negligible  compared  uith  the  other 
costs,  in  addition  to  making  T (sub (P) ) » TIP).  Thus  formula 
(E3)  becomes  simply 

T((dl"/d')  0)  T (dl  H)  (E4) 

If  in  addition  ue  assume: 

(ASH'S)  There  is  no  appreciable  increase  in  cost  of  the  EZ 
Algorithm  due  to  non-constant  leading  coefficients, 
so  that  Step  <A8)  can  be  used  instead  of  (AS’ ) and 
still  avoid  the  leading  coefficient  blouup  problem. 

Then  formula  (E4)  becomes  T(0)  T (H)  (E5) 

Note  that  this  assumption  is  stated  more  strongly  than 
necessary  for  formula  (E5)  to  hold.  Actually,  so  long  as 
the  LCB  problem  is  not  very  severe  (i.e.  the  unuanted  unit 
multipliers  u ard  1/u  as  discussed  in  Section  1 1 -5 (a)  and 
here  equal  to  i/dl  and  dl  respectively  are  small  mod  S , and 
only  cause  minor  increases  in  the  number  of  terms),  ue  can 


still  use  (A8)  instead  of  (A8‘)  of  the  EZGCD  Algorithm. 
Anyway,  (ASN3)  assumes  that  the  cost  for  restoring  leading 
coefficients  is  small,  especially  when  the  content  or  the 
multiplier  is  smal  I . 

If  assumptirn  (ASMS)  is  made  separately  from  .ASf15) , 
then  the  cost  for  substitution  transformation  still  remains 
and  formula  (E3)  becomes 
T ( sub (□) ) T (sub (H) ) 


+ IT (U)  + T (sub(O) ) + T (sub(H))]  (n/v) 


where  n is  the  maximum  term  degree  of  U. 

On  the  other  hand,  if  we  make  the  same  intuitive 
arguments  as  we  did  for  the  GHA  above,  and  reason  that  in 
formula  (E3)  substitution  cost  can  be  proportionally 
included  or  simply  use  the  formula  (E2)  instead  of  (El)  for 
the  proportional  cost  of  GHA,  then  we  have  a proportional 
cost  for  EZGCD  Algorithm: 

T'sub((dl’7dl)  0)  T (subldl  H))  © T(sub(dl”  U) ) (E7) 

under  only  assumptions  (ASfll)  - (A3f14).  That  is  the  cost  of 
EZGCD  Algorithm  is  proportional  to 

T ( sub ( (d I ”/d I ) D)  T ( sub ( d t H)),  except  when  d I " U is  dense, 
then  it  is  proportional  to  T (subldl11  U) ) until  the  problems 
become  so  large  that  multiplication  costs  begin  to  be 
equally  significant  to  the  total  costs,  then  (E7)  is  taken 


as  an  actual  sum  so  that  both  terms  contribute  to  the  total 


— ■ 


cost. 


The  above  formula  reveals  a computing  cost  of  polyno 
mial  GCD's  uhich  i9  essentially  proportional  to  the  number 
of  terms  of  the  polynomials.  Specifically,  the  cost  is 
proportional  to  the  number  of  terms  of  the  polynomial  used 
for  the  Hensel  construction  in  case  it  ie  dense,  otherwise, 
proportional  to  the  product  of  the  number  of  terms  in  the 
codivisors  obtained  by  the  Hensel  construction. 

It  is  important  to  point  out  that  the  cost  estimation 
formula  involves  both  the  given  polynomials  of  the  GCD 
problem  and  the  answers  (the  GCD  and  cofactors).  Sometimes, 
such  an  expression  can  have  only  questionable  value. 

Houever,  ue  first  emphasize  that  what  our  formula  reveals  is 
the  clear  fact  that  polynomial  GCD  computation  (and  probably 
many  other  polynomial  computations)  is  an  input  and  output 
dependent  process.  Knowing  only  the  given  inputs  simply 
will  not  allow  a good  estimate  of  the  computing  costs, 
except  possibly  in  certain  special  cases.  In  our  situation, 
the  nearly  completely  sparse  polynomial  is  such  a special 
case  where,  because  T(P  Q)  - T (P)  T(Q)  in  spar  se  cases,  the 
input  polynomials  alone  can  provide  sufficient  information 
to  estimate  the  computing  coet.  In  fact,  the  computing  cost 
is  linearly  proportional  to  the  number  of  terms  in  the 
polynomial  used  for  the  Hensel  construction.  This 
constitutes  a great  advantage  over  the  Nodular  GCD  Algorithm 
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uhen  the  given  polynomials  are  sparse  (and  the 
transformations  necessary  for  the  Hensel  construction  do  not 
blou  up  the  number  of  terms  severly).  For  the  other  extreme 
- the  dense  case,  the  situation  is  more  complicated.  So 
long  as  the  multiplication  cost  doe3  not  dominate,  input 
polynomials  still  provide  enough  information  to  estimate  the 
actual  computing  cost,  as  data  of  cases  2 and  5 will  show  in 
the  next  section.  In  Case  5,  the  biggest  dense  quadratic 
bolynomials  (in  4 variables  only),  for  uhich  ue  dared  to 
attempt  a GCO  computation,  has  1875  terms.  But  this  fact 
makes  clear  why  ue  can  not  and  should  not  consider 
operations  on  large  dense  multivariate  polynomials  (yet)  and 
uhy  asymptotic  analysis  of  algorithms  may  be  misleading  for 
realistic  problems  uhich  are  uithin  the  grasp  of  present-day 


computation. 

Next  ue  emphasize  a previously  neglected  point  in 
computing  time  analysis.  One  of  the  most  important  purpose 
of  performing  computing  time  analysis  for  an  algorithm  is  to 
account  for  the  major  effort  put  into  the  computation,  and 
one  hopes,  predict  the  cost  for  similar  computations  in  the 
future.  This  goal,  if  achieved,  is  significant  in  many 
obvious  uays.  Not  only  can  one  reveal  the  asymptotic 
behavior  of  the  algorithm  and  the  correctness  of  the  program 
implementation,  but  one  can  also  estimate  future 
computational  efforts  so  that  no  uasteful  or  impossible 
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computations  are  attempted  and  all  limitations  of  the 
algorithm  can  be  realized.  Of  course,  comparisons  of 
different  or  similar  algorithms  then  become  more  meaningful 
also.  Uhat  we  uill  show  uith  test  cases  in  the  next  section 
is  that,  for  a particular  class  of  problems  whose  general 
structures  are  knoun,  having  computed  the  GCO  and  cofactors 
of  a simpler  problem,  our  cost  estimating  formula  enables  us 
to  predict  the  cost  or  the  computing  time  of  a more  complex 
problem  in  the  same  class  to  an  accuracy  that  is 
surprisingly  good  especially  as  the  problems  get  large. 

That  is  exactly  the  point  and  the  importance  of  what  we  call 
the  "ratio  test". 

IV  - 3 Empirical  Computing  Results 

Uith  the  analytical  formulas  for  costs  of  computing 
GCD’s  by  the  EZGCD  Algorithm  based  on  the  analysis  of  the 
Generalized  Hensel  Algorithm,  we  now  present  several  tables 
of  empirically  observed  computing  times  (in  seconds)  for  the 
Reduced,  Modular,  and  the  EZGCD  Algorithms.  Uith  these 
actual  run  times,  we  hope  to  show  not  only  the  performances 
of  these  algorithms  foi  various  classes  of  problems  but  also 
substantially  validate  our  more  intuitively  derived 
computing  cost  formulas  for  the  wide  range  of  practically 
sized  problems.  Ue  will  find  that  the  actual  computing 
times  can  usually  be  predicted  to  within  15%  accuracies  in 

. 


i 
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most  empirical  test  cases  when  the  intuitive  cost  estimation 
formula  (E7)  is  used.  Such  accuracies  are  quite  good 
considering  the  wide  variety  of  bas'c  operations  underlying 
such  an  algorithm  for  the  computation  of  polynomial  GCD’s. 

The  three  algorithms  tested  are  implemented  (hopefully) 
uithoui  any  bias  because  of  their  independent  designs  and 
codings  in  the  symbolic  manipulation  system  HACSYHA  IMAC73J 
and  run  on  a PDP-10  computer  with  a 2 microsecond  memory. 

The  particular  examples  are  chosen  to  demonstrate  specific 
features.  For  the  most  part  they  are  chosen  for  simplicity 
and  clarity,  especially  u.r.t.  the  integral  coefficients, 
and  are  theruise  (hopefully)  quite  random.  Note  that 
timings  for  the  Reduced  GCO  Algorithm  are  included  here 
mainly  for  the  purpose  of  demonstrating  the  combe t i t i veness 
of  the  PRS  type  of  algorithms  in  small  and  certain  sparse 
cases.  The  general  blowup  behavior  of  the  PRS  GCO 
algorithms,  even  the  most  efficient  subresultant  PRS  GCO 
algorithm,  for  large  and  dense  problems  is  quite  well  known 
and  expected.  Such  a phenomenon  will  also  be  revealed  by 
the  data  below.  Our  main  purpose  for  this  empirical  data, 
however,  is  to  compare  the  performances  of  the  Modular  and 
EZGCO  Algorithms  and  to  support  the  analytical  computing 
time  predictions  with  some  actual  figures.  Numbers  in 
parentheses  indicate  the  number  of  "garbage  collections", 
which  cost  aproximately  3 seconds  each,  made  during  the  GCO 


computation.  This  number  gives  a fairly  good  indication  of 
the  storage  space  consumed  by  intermediate  expressions. 

Case  Is  gcd(F,  G)  » 1; 

v v 

F - (x  * II  (x  *£  y + 2)  s 

i »1  i »1 


2 2 
G - (x  +7  U 


2 2 2 

/ y + 1)  (-3  y x + y - 1) . 
* i 1 1 


EZ. 

0.307 

0.457 

0.837 

0.855 

1.092 

REO. 

0.5G8 

2.544 

10.55 

44.02(2) 

147.5(10) 

MOO. 

0.213 

0.390 

0.601 

0.849 

1.144 

This  GCO  » 1 case  is  supposedly  best  suited  for  any  modular 
algorithm.  Since  the  Modular  and  the  EZGCD  Algorithms  are 
based  on  modular  arithmetic,  they  both  perform  well  as 
expected. 


Case  2:  Linearly  dense  quartics  with  quadratic  GCO’s. 
v 

D » (x  +^T\  + 1)  ; 

i-1 


■ 1 11 1111111  1 1 


P(.UJ  ,i  mv.  UU...A..I. 
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F - 0 (x 


i-i 


- D (x  i 


G - D (x  +"\  y +•  2)  . 
!«1 


V 

- 1 

2 

3 

4 

5 

EZ. 

1.320 

2.520 

5.124 

9.475 

19.75(1) 

RED. 

0.897 

3.3'  7 

10.85 

34.34(1) 

— 

MOD. 

1.194 

G.071 

33.98(1) 

180.4(7) 

— 

This  is  a dense  manic  case  where  both  F and  G have  degree  4 
in  x,  the  main  variable,  and  their  GCD,  D,  is  a quadratic 
polynomial.  For  the  Reduced  GCD  Algorithm,  the  PRS 
sequences  are  normal  and  involves  only  tuo  pseudo-division 
steps.  For  the  Modular  GCD  Algorithm,  ue  have  degree  d - 4 
in  each  variable,  so  that  according  to  the  formula  of 

v+1 


computing  time  (d  + 1)  , the  ratio  of  computing  times  for 

our  polynomials  above  should  be  d + 1 « 5 and  ue  see  that 
the  ratios  between  successive  pairs  of  the  actual  computing 
times  (less  approximately  three  seconds  for  each  garbage 
collection)  approximate  the  prediction  remarkably  well.  For 
the  EZGCD  Algorithm  ue  uill  also  perform  this  "ratio  test". 
The  ratios  betueen  succesive  pairs  of  actual  computing  times 
are  roughly  RA(2,  1;  - 1.9,  RAO,  2)  - 2.0,  RA(4,  3)  - 1.8, 
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and  RA (5,  4)  ■ 1.7,  rounding  to  two  significant  digits. 

Since  the  given  polynomials  are  wonic  and  dense  uhile  the 
number  of  terms  are  still  reasonably  small,  formula  (E7) 
would  apply  in  this  case  as  simply  T (U) . Here,  F of  the  two 
original  polynomials  F and  G,  since  their  degrees  are  the 
same  in  x,  is  used  for  the  Hensel  construction.  The  number 
of  terms  are  T(U)  - T(F)  - 13,  29,  S7,  102,  170.  Thus 
ratios  of  theoretically  predicted  costs  are  ratios  of 
successive  T (U)  which  are  RT (2,  1)  - 2.2,  RT (3,  2)  * 2.0, 
RT(4,  3)  » 1.8,  and  RT (5,  4)  * 1.7,  also  rounding  to  two 
digits.  As  we  can  see,  this  also  agrees  remarkchly  well 
with  the  ratior  of  actual  computing  times.  Except  possibly 
the  firoi  ratio,  which  is  due  to  the  high  overhead  (total 
preparation  costs,  including  computing  contents  and 
substitutions)  which  is  unusually  high  for  the  first  small 
example.  This  phenomenon  will  also  occur  for  later  cases. 
But,  in  each  case,  as  the  problem  size  increases  the  ratios 
tend  to  agree  better  and  better.  This  is  in  fact  the  reason 
for  making  these  ratio  tests  - as  the  problem  sizes  increase 
in  our  "controlled  experimental"  environments  the  relatively 
small  overhead  costs  become  dominated  and  the  total  cost  of 
the  algorithm  corresponds  more  and  more  to  the  number  of 
terms  of  the  polynomials  involved  . This  gives  strong 
support  to  our  heur i st ical ly  derived  estimate  of  computing 
costs  for  the  EZGCO  Algorithm  as  well  as  Generalized  Hensel 


This  is  again  a monic  case,  but  nou  the  polynomials  F and  G 
are  quite  sparse.  Thus  formula  (E7)  applies  as  T(D)  T(H). 

T (□)  =>  T(H)  = 3,  +,  5,  G,  7,  8.  Thus  the  ratios  of 
theoretically  predicted  costs  are  approximately  RT (2.  !.)  » 
15/9  = 1.8,  RT (3.  2)  » 25/15  ■ 1.55,  RT(4,  3)  - 36/25  - 
1.44,  RT (5,  4)  - 49/36  - 1.36,  and  RT (5,  5)  - 64/49  - 1.38. 
Compared  with  ratios  of  actual  computing  times  RA (2 , 1)  » 


1.4,  RAO,  2)  - 1.50,  RA(4,  3)  - 1.42,  RAO,  4)  - 1.3G,  and 
RAO,  5)  - 1.30,  we  again  have  very  good  agreements.  Also 
in  this  case  the  Reduced  GCD  Algorithm  performs  quite  well, 
because  of  the  sparseness  of  the  problems  and  the  symmetric 
regularity  of  the  given  polynomials.  But  the  next  table 
shows  that  slight  variations  of  the  input  polynomials  could 
cause  the  timing  of  the  Reduced  Algorithm  to.  change 
drastically  uhere  as  the  EZGCD  Algorithm  remains  quite 
stable. 

Case  31:  0 and  F as  above; 


G - 0 <x  + 


d-; 


V 

- 1 

2 

3 

EZ. 

0.S20 

1.112 

1.757 

RED. 

0.43G 

G.  141 

134.9(5) 

noo. 

0.721 

7.88G 
1 

158.9(9) 

Since 

G is  of 

O 1 

-j 

degree  in 

is  used  for  Hensel  construction  according  to  the  heuristics 
set  in  EZGCD  Algorithm.  Here  G is  nearly  completely  sparse 
so  that  the  formula  T (D)  T (H)  still  applies  and  the  RT 
ratios  are  the  same  as  above.  Compared  with  ratios  of 
actual  computing  times  for  the  data  of  this  table 


RA(2,  1)  - 1.8,  RAO.  2)  - 1.58,  RA<4,  3)  - 1.48. 

RA(5,  4)  - 1.38,  and  RA (6,  5)  ■ 1.31,  we  still  have  fairly 
good  agreements. 


Case  4:  Quadratic  non-monic  GCO  with  quadratic  cofactors. 


2 2 ^ 2 
D - y x +\  y + Is 

1 L-J  I 

i-2 


2 2 \ 2 
F - D (x  - y + y y - 1); 

1 L-J  i 

i-2 


G - 0 (y 


.-♦E 


y + 2)  . 


i-2 


V 

- l 

2 

3 

4 

5 

EZ. 

1.184 

1.612 

2.506 

3.822 

5.549 

RED. 

0.G97 

6.480 

55.58(1) 

407.6(20) 

— 

mo. 

1.487 

6.754 

39.15(2) 

190.9(10) 

— 

All  three  previous  cases  have  been  with  monic  polynomials 
which  not  only  avoid  the  leading  coefficient,  problem  but 
also  the  non-zero  substitution  problem  for  the  Hensel  con- 
struction in  the  EZGCD  Algorithm.  So  we  now  test  a more 
complex  situation  for  the  EZGCD  Algorithm.  Here  we  must  use 


i 


i 


r 


(E7)  as  T ( sub ( (d I "/dl ) 0)  T(sub(dl  H))  with  U - F,  dl"  - y , 

1 


and  dl  ■ y . Since  I c (F)  ■ y , the  evaluation  value  for  y 
1 1 1 

can  not  be  0,  so  that  there  will  bo  some  blouup  in  the 

number  of  terms  when  ue  substitute  y + b for  y . In  this 

1 1 1 

case  T ( sub ( (d I " /d I ) 0)  - 4,  5,  S,  7,  8,  since  d I "/d I ■ 1 and 


each  y x in  D becomes  three  terms  after  substitution. 

1 

T(sub(dl  H) ) » 8,  11,  14,  17,  20.  Note  that,  in  counting 

the  terms  above,  ue  have  implicitly  assumed  that  all  other 

evaluation  values  except  b for  y are  zeros.  ThuB  the 

1 1 

corresponding  ratios  of  theoretically  predicted  costs  are 
RT (2.  1)  - 55/32  - 1.73,  RT'3,  2)  - 84/55  - 1.53, 

RT‘4.  3)  . 119/84  - 1.42,  and  RT (5,  4)  - 160/119  - 1.35. 
Compared  uith  the  ratios  of  actual  computing  times 
RA (2,1)  - 1.36,  RAO,  2)  - 1.56,  RAW,  3)  - 1.50,  and 
RA(5,  4)  ■ 1.45,  the  agreements  are  still  reasonably  good 
considering  the  many  uncertainties  in  this  case.  Tuo  spe- 
cifically important  uncertainties  are  due  to  the  fact  that 
the  total  number  of  terms  involved  is  still  quite  small  so 
the  overhead  costs  are  not  completely  dominated  and  the 
fact  that  this  is  a case  intermediate  betueen  dense  and 


1 


1.032 

3.712 

IS.  77 

0.845 

13.47 

525.6(45) 

0.814 

4.258 

22.85(1) 

Th i 9 i9  a completely  dense  case  for  which  the  Modular  GCO 


I 
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Algorithm  is  at  its  best.  Since  dl"  - gcd(lc(F),  lc(G)) 
v 

■ lc(0)  ■ dl  ■ (y  +1)  and  (dl"  F),  (dl"  G)  are  U9ed 
1 i 
i-1 

for  the  Modular  Algorithm,  the  computing  time  formula  for 

v 

this  case  should  be  (d  + 1)  (2  d + 1 with  d - 2.  Thus  the 
predicted  ratio  of  computing  times  for  Modular  GCD  Algorithm 
is  5.  Ue  see  that  this  ratio  is  approximately  observed  in 
the  actual  computing  times  of  the  Modular  Algorithm.  Next, 
we  note  that  even  though  d ■ 2 for  all  variables  in  F and  G, 
and  D is  linear  and  dense  so  that  the  PRS  sequences  consist 
of  results  for  only  one  pseudo-division,  the  Reduced  GCD 
Algorithm  stili  loses  badly  to  the  other  two  algorithms.  As 
for  the  EZGCD  Algorithm,  this  cace  represents  a difficult 
extreme  uhere  there  are  not  only  leading  coefficient  blouup 
problems  but  also  dense  leading  coefficients  in  dense 
polynomials.  The  actual  computing  times,  however,  tuned 
out  to  be  good  compared  with  the  Modular  time9.  The  ratios 
of  these  times  are  RA (2,  1)  - 3.G,  RAO,  2)  - 4.5, 

RA(4,  3)  « 5.0.  On  the  other  hand,  since  the  polynomials 
are  dense,  substitutions  do  not  change  the  number  of  terms, 
so  that  we  can  use  formula  (E7)  for  the  co9t  of  the  EZGCD 
Algorithm  with  cost  - T(d!"  U)  where  U - F because 
deg(F)  - deg(G).  Thus,  similar  to  the  Modular  Algorithm,  we 


have  also  T (d I " U)  - 3*5**v.  According  to  this,  the  ratio 
of  theoretically  predicted  costs  is  5 for  all  pairs  of  suc- 
cessive problems,  indeed,  we  see  the  actual  ratios  approach 
this  number  5 as  v gets  large,  or  as  the  problem  becomes 
sufficiently  large  ?o  that  the  overhead  costs  are  dominated. 


Case  51:  Sparse  non-monic  quadratic  polynomials  with  linear 

CCD’s 


v 


v 

- 1 

2 

3 

4 

EZ. 

0.644 

1.436 

4.605 

26.6911) 

RED. 

0.164 

0.245 

0.337 

0.431 

noo. 

0.710 

— 1 '1  ■ w 

3.771 

22.41(1) 

113.0(5) 

This  case  is  only  slightly  different  from  Case  5 as  we  can 
easily  see,  except  that  tne  polynomials  are  very  sparse 
here.  In  fact,  each  of  F and  G has  only  three  terms.  For 
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such  sparse  polynomials  with  low  degrees  (2)  the  number  of 
PRS  elements  is  very  small  for  the  Reduced  GCD  Algorithm. 
Actually,  for  each  problem,  the  PRS  is  normal  anr  requires 
only  one  pseudo-division,  so  that  there  can  not  be  much 
growth  at  all.  That  accounts  for  the  relatively  fast  com- 
puting times  for  the  Reduced  algorithm  in  the  above  table. 

For  the  Modular  Algorithm,  the  timings  are  surprisingly 
similar  to  the  dense  problems  of  Cas?  S.  The  common  ratio 
for  predicted  times  and  for  the  actual  computing  times 
remains  5,  which  clearly  demonstrates  the  insensitivity 
of  the  modular  algorithm  to  the  sparseness  of  the  polynomials 
and  the  computing  time  formula  derived  for  this  algorithm 
indicates  this  "worst-case"  behavior.  It  shows  that 
the  numbr-  of  variables  in  the  problem  and  their  maximum 
degrees  are  the  only  parameters  affecting  the  computing 
times  for  the  Modular  GCD  Algorithm.  These  are  not 
yet  the  main  points  to  be  made  with  the  empirical  data  of 
this  case.  The  major  observation  to  oe  made  here  is  that 
this  is  a bad  case  for  the  EZGCO  Algorithm  also.  Since  the 
polynomials  in  x have  products  of  pouers  of  all  other  vari- 
ables as  leading  coefficients,  the  validity  requirement  for 
the  evaluation  value?  forces  them  to  be  non-zero.  Thus, 

after  the  substitution  transformations  (y'  + b for  y ) are 

i i i 

performed  on  the  polynomial  used  for  Hensel  construction 


ki&uiij 


195 

(in  this  case,  it  does  not  matter  if  it  is  F or  G),  the  neu 
polynomial  that  the  Multivariate  EZ  Algorithm  has  to  uork 
with  is  just  as  dense  as  the  problems  of  Case  5.  In  fact, 
the  number  of  terms  ui 1 1 be  exactly  the  same  with  only  dif- 
ferent numerical  coefficients  (due  to  differences  betueen 

various  b 's  of  Case  5*  and  2 of  Case  5).  So,  T(eub(dln  „')) 

i 

for  this  case  is  the  same  as  T (d I " U)  of  Case  5 uhich  is 
3*5**v.  Accordingly  tho  ratio  of  predicted  costs  for  pairs 
of  successive  problems  it  also  5.  Indeed,  we  see  that 
RA(2,  1)  = 2.2,  RAO,  2)  ■ 3.2,  and  RA(4,  3)  • 5.1  uhich  ie 
approaching  5 as  the  problem  get  large.  Note  that  the 
actual  computing  times  of  this  case  are  proportionally  lees 
than  those  of  the  dense  problems  and  those  of  the  Modular 
Algorithm,  because  of  the  fact  that  the  blowup  due  to  non- 
zero substitutions  only  occurs  ;n  the  step  of  EZGCO  Algo 
rithm  where  the  Hensel  construction  is  applied  and  not  in 
other  steps.  Therefore,  we  emphasize  the  point  that  uhen 
the  validity  requirement  of  the  evaluation  values  force  them 
to  be  non-zero  and  cause  the  non-zero  substitution  problem, 
the  computing  time  unfortunately  goes  up  close  to  that  of 
more  dense  problems.  The  phenomenon  is  quite  similar  to 
that  of  the  Modular  GCO  Algorithm,  but  these  bad  cases  for 
the  EZGCO  Algorithm  occur  much  less  frequently  (since  they 
depend  on  the  number  of  non-zero  values  for  valid  and  lucky 
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evaluations)  than  the  bad  casec  for  Modular  Algorithm  uhich 
depends  solely  on  the  number  of  variables  and  their  degrees 
in  the  given  polynomials. 

Case  6:  Trivariate  Polynomials  uith  increasing  degrees 

j 

0 - x y (z  - 1 ) ; 

j j+1  j 

F - D (x  + y z +1): 
j+1  j j+1 

^*D(x  +y  z -7). 


j 

- 1 

2 

3 

4 

5 

EZ. 

1.132 

1.523 

1.334 

2.443 

3.067 

RED. 

0.836 

1.750 

2.637 

3.884 

5.321 

MOO. 

5.203 

15.24(1) 

31.42(3) 

51.88(5) 

82.85(3) 

For  all  previous  cases,  ue  have  used  the  number  o' 
variables,  v,  as  the  major  parameter  for  experimentations. 

Ue  observed  that  in  those  cases,  the  Modular  as  uell  as  the 
Reduced  GCD  Algorithms  are  extremely  sensitive  to  the 
increase  of  v.  The  EZGCD  Algorithm  depends  more  on  the 
number  of  terms  it  has  to  process,  rather  than  on  v alone, 
so  it  is  more  efficient  in  the  sparse  polynomial  situations. 
Ue  also  sau  the  predictabilities  of  the  actual  computing 
times  uith  some  of  our  formulas,  especially  for  the  Modular 
and  the  EZGCO  Algorithms. 

In  this  case,  ue  concentrate  on  testing  another 
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important  parameter  - the  degree  of  the  polynomials.  Our 
purpose  is  also  to  see  the  performance  of  the  EZGCO 
Algorithm  for  sparse  polynomials  but  having  a leading 
coefficient  blowup  problem  that  worsens  with  respect  to  the 
degree.  First  of  all,  the  numbers  in  the  above  table  reveal 
the  fact  that  the  EZGCO  Algorithm  still  performs  well  even 
under  these  adverse  conditions.  Looking  at  the  polynomial 
inputs  in  more  detail,  we  have  dl"  - dl  • y z.  This  forces 
the  evaluations  to  be  non-zero.  In  addition,  the  cofactors 
contain  high  powers  of  y and  Z,  SO  that  the  blowup  due  to 
non-zero  substitutions  in  Hensel  constructions  are  quite 
severe.  In  fact,  we  get  dense  polynomials  in  y and  z.  Thus 
we  use  formula  (E7)  with  proportional  cost  ■ T(sub(dl"  U) ) 
corresponding  to  dense  polynomials.  U • F,  since 
deg(F)  < deg(G).  If  we  count  carefully,  T ( sub (d I ” U) ) ■ 41, 
59,  81,  107,  and  137.  From  those,  we  get  RT(2,  1)  - 53/41 
« 1.44,  RT (3,  2)  - 81/59  - 1.37,  RT(4,  3)  - 107/81  - 1.32, 
and  RT (5,  4)  - 137/107  - 1.28.  Compared  with  ratios  of 
actual  computing  times  RA (2 , 1)  - 1.28,  RA (3,  2)  - 1.27, 
RA(4,  3)  - 1.28,  and  RA(5,  4)  - 1.25,  the  agreements  are 
quite  adequate.  For  the  Modular  GCD  Algorithm,  we  should 
use  dl"  F and  dl"  G.  The  degrees  of  dl"  F in  x,  y,  and  z 
respectively  are  2j,  j + 3,  and  j + 2.  Using  thsee  degrees 
in  the  standard  formula  tor  the  Modular  algorithm,  we  find 
that  the  computing  time  should  be  growing  someuhat  as 

l illiMiiM  III-  H II  1*1.1  .iiftiiii'im  1-rii^lffliilMhalli— I MIN 
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(2  j + 1)  ( j + 4)  (j  + 3).  Thus 
RT (2,  1)  > (5  G 5 ) / (3  5 4)  - 2.50, 

RT (3,2)  - (7  7 6) / (5  G 5)  - i.96, 

RT (4,  3)  » (9  8 7) / (7  7 G)  - 1.71,  and 
RT (5,  4)  * (11  9 8) / (9  8 7)  ■ 1.57.  Compared  with  ratios  of 
actual  computing  times  RA (2,  1)  - 2.35,  RA(3,  2)  - 1.83, 
RA(4,  3)  = 1.G5,  and  RA (5,  4)  ■ 1.51,  the  agreements  are 
quite  good  considering  the  fact  that  polynomials  are 
actually  sparse  rather  than  dense  as  the  formula  expresses. 
Finally  ue  also  point  out  that  the  PRS  sequences  for  the 
Reduced  GCO  Algorithm  are  normal  in  this  case.  That 
partially  explains  the  good  performance  of  the  Reduced 
Algor i thm. 

Case  7:  GCO’s  of  trivariate  polynomials  requiring  special 

case  method  of  EZGCD  Algorithm 

P-x-yz  + 1;  Q ■ x - y + 3 z; 

j k k j 

F ■ P Q ; and  G ■ P Q . 


j.  * 

- 1.2 

1,3 

i 

1.4 

2,4 

EZ. 

2.5G8 

3.914 

5.778 

_f 

10.80 

1.354 

4.2G3 

8.492 

i 

18.39(1) 

33.58(2) 

This  is  a very  simple  trivariate  case  where  the  special  case 
method  of  the  EZGCD  Algorithm  has  to  be  U9ed.  Ue  employ 
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this  case  to  shou  that  even  though  the  special  case  method 
is  much  more  inefficient  than  the  regular  algorithm  ( (Al)  - 
(A9)),  it  is  in  fact  not  more  inefficient  than  the  other  GCO 
algorithms  in  many  cases  similar  to  the  or.e  tested  here. 
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CHAPTER  V.  Polynomial  Factorization 


V - 1 Introduction 

In  this  chapter,  we  will  discuss  methods  for  factoring 

polynomials  of  one  and  several  variables  over  the  integers. 

The  firet  computationally  feasible  method  for  the 

factorization  of  univariate  polynomials  over  Z was  due  to 

Kronecker.  As  presented  by  Van  der  Uaerden  [VDU49,  Section 

25],  the  method  is  essentially  as  follows; 

For  a given  polynomial  P(x)  of  degree  n,  let  m be  the 

greatest  integer  < n/2.  For  m + 1 integral  values 

a , a , , a ; compute  P (a  ) , ...  , Pla  ) and  let 

0 1m  0 m 

5 - the  9et  of  all  distinct  integral  factors  of  P(a  ). 


For  k - 2,...„m+l  do  the  following.  Choose  set  of  k ele- 
ments b , one  from  each  S , and  use  interpolation  to  find  a 


polynomial  Q(x)  of  degree  k-1  such  that  G(a  ) - b for  all 

I i 

0 < i < k.  If  Q(x)  divides  P(x)  then  we  have  found  a 
factor  of  P and  we  can  recursively  apply  this  method  on 
P(x)/Q(x).  Otherwise  choose  another  set  of  k b 's  from  the 


S ’s  different  from  all  previously  chosen  sets,  interpolate, 


and  test-divide  again.  Unen  all  possible  vombinat ions  of 


Preceding  pap  blank 


m + 1 or  feuer  integral  values  front  the  S 's  have  been 

i 

exhausted  (the  step  for  k * m + 1 is  done),  we  conclude  that 
P is  irreducible. 

The  reason  for  the  lack  of  success  with  Ahe  Kronecker’s 
method  even  with  today's  high  speed  computers  is  the  trial— 
and-error  nature  and  the  exponential  number  (w.r.t.  rt)  of 
possible  factors  needed  to  be  tried.  It  is  alsj  a nontri- 
vial task  to  factor  integers;  in  fact,  its  cost  is  an  expo- 
nential function  of  the  length  of  the  given  integer.  These 
exponential  growths  dependent  upon  the  degree  and  the  inte- 
gral length  of  the  coefficients  of  the  given  polynomial  pre- 
vent this  method  from  being  a practical  computational  algo- 
rithm even  after  several  improvements  were  suggested  [V0U491 
[J0HGG1 . 

Berlekamp’s  factorization  algorithms  for  polynomials 
over  finite  fields  IBERG7]  IBER70]  proved  to  be  fundamental 
for  developing  efficient  algorithms  for  factoring 
polynomials  over  the  integers.  Based  on  Berlekamp’s 
algorithm  Knuth  (KNU63,  Section  4,6.21  suggested  a method  of 
reconstructing  the  factors  over  Z using  the  Chinese 
Remainder  Algorithm.  However,  the  interpolative  nature  of 
the  Chinese  Remainder  Algorithm  (Garner’s  Rule)  retains  the 
exponential  growth  uith  respect  to  the  degree  of  the  given 
polynomial.  Although  the  exponential  growth  due  to 
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factoring  integers  is  completely  avoided  in  this  method,  the 
number  of  tentative  factors  and  their  combinatorial 
possibilities  is  still  potentially  large. 

It  uas  Zassenhaus’  suggestion  of  using  Hansel's  p-adic 
construction  (Lemma  1 1-3.1)  and  his  quadratic  improvement 
thereof  (Lemma  1 1-3.2)  that  initiated  new  research  for  more 
efficient  algorithms  in  polynomial  factorization  based  on 
mod  p factorizations.  In  his  thesis,  husser  IHUS71 J 
presents  abstract  algorithms  for  factoring  univariate  and 
multivariate  polynomials  which  theoreticaly  establish  the 
feasibility  of  using  Hensel-type  constructions  for 
factorization  in  general  algebraic  spaces.  He  also  presents 
detailed  specif icati  s,  careful  discussions,  asymptotic 
computing  time  bounds,  and  some  empirial  timings  for  the 
algorithm  for  factoring  univariate  polynomials  over  the 
integers  based  mainly  on  Berlekamp's  and  Zassenhaus’ 
algor i thms. 

For  multivariate  polynomial  factorization,  flusser’s 
abstract  algorithm  has  definite  computational  drawbacks,  as 
pointed  out  previously.  So  our  presentation  of 
factorization  algorithm  in  this  case  will  essentially  be 
based  on  that  discussed  by  Uang  and  Rothschild  IU&R73J  which 
uses  the  computationally  efficient  Generalized  Hensel 
Algorithm  of  Section  11-4.  In  addition,  we  attempt  to 
analyze  the  computing  cost  of  the  multivariate  factorization 
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algorithm  which  has  hitherto  been  lacking  in  the  literature. 


V - 2 Factorization  of  Univariate  Polynomials 

The  algorithm  for  factoring  an  univariate  polynomial 
F(x)  over  Z can  be  divided  into  six  basic  steps  as  follows: 

(1)  Set  c < — cont(F)  and  F <—  pp (F) . 

k 


(2)  Find  the  square-free  decomposition  of  F,  | | F . 

i 

1. 


I 


Each  F will  now  be  square-free  and  primitive. 


For  such  a polynomial  G we  factor  by 

(3)  Find  3 prime  p in  Z such  that  g (x)  <—  G(x)  (mod  p) 
has  the  same  degree  as  G and  is  square-free  in  (Z/p)  1x3 . 
(The  existence  of  such  a prime  can  be  shoun  quite  easily. 
For  reference,  see  [W&R73).) 


(4)  Factor  g(x)  into  o (x)  g (x)  ...  g (x)  in  (Z/p)  tx] 

1 2 r 


by  applying  the  Berlekatnp  s algorithm  over  finite  fields. 
(Doth  Musser  [MUS711  and  Uang  and  Rothschild 
[UISR731  have  detailed  discussions  and  algorithmic  imple- 
mentations of  this  algorithm  for  'small"  prime  fields.)  If 
r ■ 1,  G is  irreducible  over  Z. 

(5)  For  r / 1,  apply  Zassenhaus’  algorithm  (Algorithm 
1 1-3.2)  successively  on  complementary  products  of  these 


g *3  to  obtain  G (x),  G (x),  ...  , G (x)  such  that 
i 12  r 

G - G G ...  G (mod  q)  where  q ■ p**(2**m)  which 
1 2 r 

bounds  any  integral  coefficient  of  any  factor  of  G.  (See 

discussion  in  proof  of  Theorem  1 1 -5. 3.) 

Also  G » y (mod  p)  for  all  1 < I < r. 
i I 

(G)  Combine  extraneous  factors  and  restore  leading  coeffi- 
cient for  these  G * s and  get  the  true  factors  of  G over  Z. 

i 

Both  Algorithm  2.GP  of  IMUS71]  and  Algorithm  TRUFACTORS  of 
(I4&R731  discuss  this  step  in  detail. 

flusser  gives  a very  complete  analysis  of  asymptotic 
computing  time  bounds  for  this  univariate  case  in  his  thesis 
[F1US71],  Ue  refer  the  reader  to  flusser* s thesis  for 
details.  However,  at  this  point  we  will  call  the  reader’s 
attention  to  the  problem  of  combinatorial  ly  collecting 
extraneous  factors  into  true  factors.  For  an  irreducible 
polynomial  uith  k extraneous  factors,  as  many  as  2**k 
combinations  of  factors  and  divisions  may  be  required  before 
the  irreducibi  I i ty  of  the  polynomial  can  be  discovered. 

V - 3 Multiple  Factor  Generalized  Hensel  Construction 
Because  of  the  fact  that  a polynomial  may  have  many 
factors,  we  will  present  a new  version  of  the  Generalized 
Hensel  Algorithm  of  Section  1 1-4  which  will  construct  more 
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than  two  multivariate  factors  from  their  evaluated 

univariate  representations.  This  algorithm  will  be 

equivalent  to  GHA  when  the  numbei  of  factors  is  two.  Also, 

as  we  will  see  later,  the  computing  cost  of  this  algorithm 

will  essentially  be  l/(r  - 1)  times  less  than  that  of 

applying  GHA  r - 1 times  on  successive  groupings  of  the  r 

factors  as  the  way  this  is  done  in  [U&R731  and  [HUS71]. 

Since  we  believe  that  the  savings  in  cost  for  the  univariate 

case  will  not  be  as  significant  as  the  multivariate  case,  we 

will  only  discuss  this  sem  i -para  1 1 e I multiple  factor 

construction  for  the  multivariate  case.  Houever,  we  make 

the  claim  that  a similar  method  applies  to  the  multiple 

factor  univariate  constructions,  and  we  challenge  the  reader 

to  work  that  out  in  detail. 

Theorem  V-3.1:  (dultiple  Factor  Generalized  Hensel) 

Let  F(x,y  ,...,y  ) be  a multivariate  polynomial  in 
1 v 

(Z ty  .....y  ] ) tx] . Let  b » (b  ,...,b  ) be  a given  set  of 
1 v 1 v 

integral  values  and  S - (y  -b  ,...,y  -b  I such  that 

11  v v 

I c (F ) £ 0 (mod  q,  S)  where  q is  a given  rth  power  of  a lucky 

prime  p for  F . Assume  there  exist  pairwise  relatively  prime 
b 

univariate  polynomials  FI  (x),  F2  (x),  ...  , Fr  ix)  in 

1 1 1 

(Z/q) [xj  » ( J/S)  [x] , uhere  J » TZ/q) Ey  ,...,y  J,  such  that 

1 v 


20' 


F - FI  F2  ...  Fr  (mod  q,  S).  Then,  for  any  k > 1,  we 
111 

k 

can  step  by  step  construct  FI  , ...  , Fr  in  (J/S  ) [x) 

k k 

k 

simultaneously  such  that  F - FI  F2  ...  Fr  (mod  q,  S ) 

k k k 

and  Fi  ■ FI  (mod  q,  S)  for  all  1 < I < r. 
k 1 


Proof:  Let  F[j|r]  denote  Fj  Ftj+1]  ...  Fr  for 

1 111 

j ■ 2 Since  I c (F)  £ 0 (mod  q,  S) , it  must  be  a 

unit  in  Z/q,  hence  so  must  lc(Fi)  be,  for  i»l r.  By 

Lemma  and  Corollary  1 1 -2.1,  we  can  use  Algorithm  1 1 -2.1  to 

find  Ai  (x),  Bi  (x)  in  (J/S) [x]  such  that 
1 1 


Ai  Fi  + Bi  F(i|r]  -1  anu  deg CA i ) < deg  F(i|r]  , 

1111  1 1 

deg (B i ) < deg (F i ),  for  all  i-l,...,r-l.  From  these  poly- 

1 1 

nomials  we  uill  construct,  by  induction,  sequences  of  polyno- 
mial s iFl  i , IF2  I , ...  , IFr  I such  that  F ■ FI  . . . Fr  in 
mm  m mm 


(J/S  ) [>:]  and  Fi  - Fi  (mod  q,  S)  for  all  i«l 
m 1 


(Again  let  F I j |r]  denote  Fj  F ( j+1]  ...  Fr  for  j-2,...,r.) 

nt  m in  in 

For  inductive  hypothesis,  assume  for  m > 1 we  have 


Fi  (x,y  i,  in  (J/S  ) tx]  such  that 

ml  v 

F - FI  F2  . . . Fr  . Let 


R1  (x,y  ,...,y  ) - F - FI  (F2  ...Fr  ) (mod  q,  S ) 
m l v m m m 


. \ \l  (x)  Ml  (y ). 

/ ^ Im  ml  v 


For  each  typical  coeff  icient  polynomial.  Cl  (x) , of  R1  , we 

Im  m 

apply  Algorithm  1 1-2.2  on  FI  , Ft2|r]  , A1  , D1  , and  Cl 

1111  Im 

in  (Z/q)  [x]  and  obtain  A1  , B1  such  that 

Im  Im 

A1  FI  + B1  F I2|rl  -Cl  and 
Im  1 Im  1 Im 

deg(Al  ) < deg(F(2|r]  ) in  (Z/q) txl . Now  let 
Im  1 


■£ 


B1  (x)  Ml  (y  ,...,y  ), 
Im  ml  v 


F(2|rl 


. \ ^A1  (x)  111  (y  ,...,y  ),  FI  - FI  + FI  , 

/ J I m ml  v m+1  m 


and  F(2|rl  -F(2|r]  + F(2|r]  . Then 

m+1  m 

(m)  (m) 

FI  F(2|r]  -FI  FI2|r]  + FI  FI2|rl  + F(2|rl  FI 
m+1  m+1  mm  1 1 


wm.u  f » 1 


FI  F2  . . . Fr  + Hi  -Fin  J/S  . Ue  observe  that 


m m 


n J/S  R2  - F/Fl  - F2  ...  Fr  - Ft2|r]  - Ft2|r] 
m m+1  m B *+l  1 


, F [2|r]  . In  general  (or  inductively),  assume  ue  have 

m+1 

• - FI  F2  ...  Ftj-11  Ft j |r)  in  J/S  . then 
m+1  m+1  m+1  m+1 

Ij  - F/Fl  /.../Ftj-11  -Ftj|rl  - Ftj|rl  -Ftj|rl 
m m+1  m+1  m m+1 


■ Ft  j |r)  Cj  (x)  tlj  ty  ,...,y  ).  For  each  Cj  (x 

Im  m 1 v Im 

Im 

ue  can  apply  Algorithm  11-2.2  on  Fj  , Fl(j+l)|r]  , Aj  , Bj 

1 11 

and  Cj  to  get  Aj  and  Bj  in  (Z/q)  txl  such  that 
Im  Im  lm 

Aj  Fj  + Bj  F t( j+1)  |rl  - Cj  . Then  let 
lm  1 tm  1 Im 


EBj  (x)  flj  (y  ,...,y  ), 
Im  ml  v 


Ft(  j+1)  | rl 


■\  ^ Aj  (1j  (y ). 

/ j Im  m 1 v 


Fj  - F j + Fj  , and 
m+1  m 


F t( j+1? 1 r ] -Ft(j+l)|rl  + Ftt j+1) |rl  , ue  have 


F - FI  ...  Fj  F(( j+1) |r]  in  J/S  and 
m+1  m+1  m+1 


Rtj+11  ■ Ft(j+l)|r]  . Thus  ue  can  >jet  F - '-'1  ...  Fr 

m . m+1  m+1 


in  J/S  by  making  r - 1 such  iterations  where  for  all 

(1)  (m) 

m > 1 we  have  Fj  ■ Fj  + Fj  + ...  + Fj  for 
m+1  1 

j - 1 ..... r which  exactly  corresponds  to  the  two-factcr  case 
of  GHA.  This  completes  the  induction  and  the  proof.  // 
Ue  will  not  specify  the  algorithm  for  this  multiple 
factor  generalized  Hensel  construction  in  detail  since  the 
proof  of  the  theorem  is  constructive.  Ue  will,  however, 
point  out  that  all  of  thr  special  preparatory  computational 
considerations  discucsed  in  Section  1 1 —4  still  can  be 
applied  here.  Thus  this  algorithm  is  a direct  extension  of 
GHA  (Algorithm  II-4.1).  Referring  to  the  analysis  of  the 
computing  cost  of  GHA  in  Section  IV-2,  ue  3ee  that  because 
of  the  simultaneous  construction  of  all  factors  uhich 
results  in  savings  of  the  multiplication  costs  (corres- 
ponding to  Step  (5)  of  GHA).  the  corresponding  total  com- 
puting cost  for  this  algorithm  is  approximately 


(n/v)  ( T (F)  + 


2>.’ 
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Ue  a 1 30  point  out  that  the  total  space  requirement  for 
storing  preparatory  data  and  intermediate  results  of  this 
simultaneous  construction  is  essentially  the  same  as  that  of 
applying  GHA  successively  r - 1 times.  But,  if  C represents 
Hu?  cost  of  applying  GHA  once,  then  this  simultaneous  con- 
struction effectively  improves  the  cost  from  (r  - 1)  C to  C 
which  can  be  most  significant  if  the  number  of  factors  of  F, 
r,  is  large. 

V - 4 Factorization  of  Multivariate  Polynomials 

The  only  known  implementation  of  the  Hensel  type  multi- 
variate factorization  algorithm  is  by  Uang  and  Rothschild  on 
the  symbolic  manipulation  system  MACSYMA.  An  outline  of 
such  a factorization  algorithm  is  as  follows  where  the  main 
difference  here  is  in  Step  (4)  where  we  apply  the  Multiple 
Factor  Generalized  Hensel  Algorithm  instead  of  using  the 
ordinary  GHA  r - 1 times.  Given  a multivariate  polynomial 

r (x.y y ) in  Ztx.y y ] 

1 v 1 v 

(1)  By  taking  content  and  primitive  part  of  F,  we  can  write 
F » cont(F)  pp (F ) and  factor  each  part  separately.  By  com- 
puting the  square-free  decomposition  of  F,  ue  can  write 
2 k 

F » F F ...  F and  factor  each  F separately  then  simply 
12k  i 


By 


!*x»V*A ip  *Mt MM«(g| 
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attach  the  multiplicities  of  tne  factors.  Thus  ue  uill 
assume  that  the  F to  be  factored  is  primitive  and  square-free. 


(2)  Choose  a set  of  integers  b • lb  ,...,b  l such  that 

1 v 


fix)  - F (x)  - Flx,b  b ) is  also  square-free  and 

b 1 v 


deg(f)  - deg(F)  in  x. 

(3)  Apply  the  univariate  factoring  algorithm  described 
in  Section  V-2  to  find  a factorization  of  f such  that 


fix)  ■ f (x)  f (x)  ...  f lx)  over  Z.  In  the  process  of  this 
1 2 r 


computation  ue  can  also  determine  a prime  p and  an  integer  m 
such  that  q » p**!2**m)  bounds  tuice  the  coefficients  of  any 


factor  of  F and  its  evaluation  at  b so  that  f over  Z is 


i 


identical  to  f Imod  q)  and  the  f 's  remain  pairuise  rela- 

i i 


tively  prime  modulo  q. 


(A)  Let  S - ly  -b  , . . . ,y  -b  I , then  ue  have 
11  v v 


Flx.y  ,...,y  ) — f (x)  ...  f lx)  Imod  S).  Apply  the  flul- 
1 v 1 r 


tiple  Factor  Generalized  Hensel  Algorithm  discussed  in  the 
last  section  on  these  univariate  factors  to  compute  pairuise 


relatively  prime  multivariate  polynomials  F (x,y  g ), 

i 1 v 


i-l,...,r,  corresponding  to  f (x)  such  that 

i 


F » F F ...  r (mod  S , q)  where  n • 1 + the  degree  bound 
1 2 r 

for  F in  :t9  non-wain  variables  y y . 

1 v 

15)  From  these  F 's,  all  true  irreducible  factors  of  F over 


Z can  be  computed  by  taking  combinations,  trial  divisors, 
and  the  restore-leading-coefficient  operations. 

The  cost  for  factoring  a primitive,  9quare-free  multi- 
variate polynomial  F,  then,  mainly  consists  o*  the  costs 
from  Steps  (3),  (4),  and  (5).  The  cost  of  S'.ep  !3)  is 

simply  for  factoring  the  univariate  polynomial  F (x)  into 

b 

f (x),  f (x) f (x)  which  was  discussed  in  Section 

1 2 r 

V-2.  Step  (4)  involves  the  application  of  the  Multiple 
Factor  Generalization  Hensel  Algorithm  in  the  last  section. 
According  to  the  brief  analysis  of  this  algorithm,  the  total 
cost  of  this  9tep  is  proportional  to 


T (F 

i 


) + (n/v)  (T(F)  +\  TCF  )). 


E1 


Step  (5)  is  very  much  dependent  on  the  choices  of  evaluation 
points  and  the  prime  modulus.  The  worst  case  is  when  F is 


irreducible  but  (mod  q,  S );  it  h39  r extraneous  factors. 


In  this  case,  it  will  take  as  many  as  2 ru I tipi  ications 


I 


i f 

i 
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of  the  extranecufj  factors  and  trial  divisions  to  confirm  the 
irreducibi I i ty  of  F.  In  practice,  not  very  many  trial 
operations  are  required  and  ths  true  factors  of  F are 
quickly  found.  For  each  true  factur  of  F,  then,  it  may  be 
necessary  to  restore  the  leading  coefficient,  and  that  in- 
volves a content  and  primitive  part  computation  on  some  com- 
bination of  the  potential  factors. 

Finally,  we  observe  that  because  of  the  high  cost  in- 
volved in  processing  an  extraneous  factor,  especially  in 
Steps  (4)  ar j (5),  it  seems  wise  and  worthwhile  to  repeat 
Steps  (2)  and  (3)  several  times  in  order  to  find  an  evalua- 
tion and  a prime  such  that  the  number  of  potential  factors 
is  minimal. 


K; 

? 


u.-iifi'.kSriLi.DL  &-V..CA  ...  ■■■■.  ■-  i -k.  ifY'lll  r'P  . . 
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CHAPTER  VI.  POLYNOfll AL  CONTENTS  AND  PRIMITIVE  PARTS 
VI  - 1 Introduction 

In  the  previous  chapters,  we  have  seen  ‘he  important 
parts  played  by  the  operation  of  computing  contents  and 
primitive  parts  of  polynomials.  Most  notably,  the  first 
step  of  both  the  factorization  and  the  GCD  algorithms  is 
mainly  performing  this  operation.  To  refresh  our  memories, 
any  polynomial  P in  JM,  with  J being  any  u. f.d.,  can  be 
rewritten  in  a unique  representation  P - cont(P)  pp CP ) ahere 
cont(P)  is  the  unit  normal  GCO  of  all  the  non-zero  coeffi- 
cients of  P ard  pp(P)  - P/cont (PI  is  the  remaining  primitive 
part  of  P.  Specifically,  a multivariate  polynomial  P in 

Zlx.y  ] will  actually  be  considered  and  represanted  in 

1 v 

(Z ly DM.  That  is  P is  considered  to  be  a polyr.o- 

1 v 

mial  in  x with  coefficients  as  polynomials  in  the  other 
variables.  The  importance  of  the  content  and  primitive  part 
operation?  comes  exactly  from  this  requirement  of  choosing  a 
main  variable  and  a unique  representation  of  polynomials 
with  respect  to  the  main  variable.  Among  the  basic 
arithmetic  operations  on  polynomials,  division  essentially 
makes  the  determination  of  a main  variable  mandatory. 

Factors  and  GCO’s  of  polynomials  are  in  fact  divisors,  hence 
the  particular  importance  of  the  computation  of  contents  and 
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primitive  parts  for  these  algorithms. 

In  general,  contents  and  primitive  parts  are  relative 
operations,  i.e.  they  are  operations  with  respect  to  a par- 
ticular variable,  cont(P)  • cont(P,  x)  for  some  variable  x. 
However,  there  are  at  least  two  other  kinds  of  useful 
notions  of  contents  and  primitive  parts  which  are  essen- 
tial ly  absolute  instead  of  relative  to  any  particular  varia 
ble.  One  such  is  often  called  the  term  content. 


tcont.  For  an  arbitrary  polynomial  P(x  ,x  x ), 

1 2 v 


kl  k2  kv 

tcont(P)  » x x ...  x , i.e.  the  monomial  which  is  the 
1 2 v 


product  of  the  variables  each  raised  to  the  power  which  is 
the  w.inimum  of  all  corresponding  variable  powers  in  all  the 
terms  of  the  completely  expanded  P.  In  other  words,  ac- 
cording to  Brown  in  a private  communication  pp (P)  is  now 
normal  or  it  is  not  divisible  by  any  monomial  except  1 or 
-1.  This  is  quite  an  easy  quantity  to  obtain  from  a polyno 
mial  P for  many  different  ways  of  representing  a polynomial 
Clearly,  if  a polynomial  is  representsd  in  a iystem  as  a 
collection  of  fully  expanded  terms,  tl  en  getting  its  term 
content  simply  involves  taking  minimum.!  c*  small  integers. 
Anothe*  such  concept  of  content  is  called  the  super 
content,  scont.  This  ia  computed  by  regular  contents  recur 
sively  on  each  variable  in  the  polynomial.  In  other  words, 
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spp(P(x  ,...,x  ))  - pp{... (pp(pp(P,  x ),x  ),...x  ) and 
1 v 1 2 v 

acont(P)  » P/spp(P).  Super  content  of  a polynomial  P 

contains  all  factors  of  P each  of  which  is  independent  of 

anyone  of  the  variables  in  P.  Thus  spp(P)  can  only  contain 

factors  which  have  all  the  variables,  or  not  divisible  by 

any  multivariate  polynomial  uhich  depends  on  feuer  than  a!' 

v variables  in  P. 

In  the  following  sections  of  this  chapter,  ue  will 
present  an  algorithm  for  computing  multivariate  polynomial 
contents  and  primitive  parts  using  the  Hensel  construction. 
Ue  will  compute  term  contents  of  the  coefficients  as  a very 
useful  preliminary  step  of  computing  regular  contents  with 
respect  to  a particular  variable.  Ue  uill  only  mention  here 
that  the  computation  cf  super  contents,  then,  simply 
involves  a recursive  applications  of  the  algorithm  for 
computing  regular  contents  on  each  variable. 

VI  - 2 The  EZCONTENT  Algorithm 

The  important  observation  to  be  made  is  that  the  Hensel 
construction  for  the  EZGCO  Algorithm  Step  IV,  Section  III-2 
is  applied  using  only  one  of  the  tuo  given  polynomials. 

That  being  the  case,  there  is  no  reason  uhy  the  EZGCD 
Algorithm  cannot  be  generalized  to  compute  the  GCD  of 
several  polynomials  at  once.  Because  the  entire  preparation 


for  Hensel  construction  in  EZGCO  Algorithm  consists  of 

evaluations,  univariate  GCD  computation,  and  some 

ver  'cation  of  the  conditions,  the  only  difference  in 

handling  several  polynomials  at  once  is  in  successively 

calculating  the  GCO  of  several  univariate  polynomials.  Thus 

ue  have  achieved  some  parallelism  for  dealing  uith  several 

multivariate  polynomials,  hence  the  so  called  semi-parallel 

process.  Since  the  content  of  a polynomial  u.r.t.  a 

particular  variable  is  simply  the  GCD  of  all  cf  the 

coefficients,  ue  have  a straight  foruard  generalization  of 

the  EZGCD  Algorithm  to  an  algorithm  for  computing  contents. 

Furthermore,  since  the  GCD  verification  step  of  the  EZGCD 

Algorithm  (Step  V,  Sect.  Ill -2)  computes  the  cofactors  as  a 

byproduct,  ue  can  easily  have  the  corresponding  primitive 

parts  as  a byproduct  of  the  content  computation. 

Ue  nou  give  an  outline  and  overvieu  of  this  generalized 

EZGCO  Algorithm  for  computing  contents  and  primitive  parts, 

the  EZCONTENT  Algorithm,  similar  to  those  given  in  Section 

1 1 1 -2.  The  detailed  algorithm  ui 1 1 clearly  be  quite  similar 

to  the  EZGCD  Algorithm  and  ui  1 1 be  omitted. 

The  EZCONTENT  Algorithm  uill  compute  the  content 

CMz.x.y  ,...,y  ) and  the  primitivt  rjrt  P’(z,x,y y ) 

I v 1 v 

uith  respect  to  the  variable  x of  a given  multivariate  poly- 

nom i a I F’  (z.x.y  ,...,y  ) in  ZIz.x.y  ,...,y  J . 

1 v 1 v 


Step  I:  (Term  Contents  of  Coefficients) 

Express  F*  as  F0’ (x,y  ) +Fl’(x,y  ) z + 

1 v 1 v 


...  + Fk*  (x,y  ,...,y  ) z . For  i - 1 k,  set 

1 v 

Ti  < — tcont(Fi’)  and  Fi (x,y  ,...,y  ) < — tpp(Fi’). 

1 v 

Note  that  some  of  the  Fi'  may  be  zero,  but  ue  assume  at 

least  two  of  the  coefficients  are  non-zero.  In  the  case 

where  only  one  coefficient  is  non-zero,  the  problem  is  of 

course  trivial,  that  coefficient  is  the  content  and  the 

corresponding  power  of  z is  the  primitive  part. 

Compute  and  3et  TC  < — gcd(F0,Fl,. . . , Fk) . Here,  since 

all  the  polynomials  are  actually  monomials  in  x,y  , 

1 v 

their  gcd  is  simply  the  product  of  variables  raised  to  the 

minimum  of  corresponding  pouers  in  each  monomial.  Thr 

cofactors  corresponding  to  TC  are  also  easy  to  compute  - by 

3Ubstraction  of  powers.  Thus  the  remaining  task  for  this 

algorithm  is  to  compute  C(x,y  ,...,y  ) gcd(F0,Fl, . . . ,Fk) 

1 v 

and  the  respective  cofactors  F0/C,  Fl/C Fk/C.  Then, 

the  content  C’  is  simply  TC  * C and  the  primitive  part 


P’  (z.x.y  y ) is 

1 v 


(T0/TC)  (F0/C)  + (Tl/TC)  (Fl/C)  z + 


• • • 


k 

+ (Tk/TC)  (Fk/C)  z . 
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The  reason  for  computing  all  the  term  contents  of  the  coef- 
ficients is  not  only  because  they  are  convenient  to  calcu- 
late but  also  because  ue  can  thus  eliminate  some  unnecessary 
blouup  problem  due  to  non-zero  evaluation  values  later.  Ue 
have  noted  before  that  non-zero  evaluations  cause  blouups 
in  the  number  of  terms,  hence  in  the  computing  cost  of  the 
GCD.  If  ue  can  reduce  the  degrees  of  the  variables  in 
each  coefficient  polynomial  by  this  term  content  operation, 
then  ue  can  also  hope  to  lessen  the  severity  of  the  blouups 
uhen  ue  compute  the  GCD  of  the  term  primitive  parts  of  the 
coefficient  uhich  are  normal. 

Step  II:  (Evaluation  and  Univariate  GCD) 

Choose  a set  of  v integers  b ■ lb  ,...,b  I such  that 

1 v 

the  degrees  in  x of  F0,  FI Fk  evaluated  at 

y -b  i - l,...,v  are  not  decreased  (i.e.  b is  valid 

i i 

for  F0,  FI Fk). 

Compute  F0  ,F1  Fk  and  C - gcd(F0  ,F1  ,...,Fk  ). 

h b b 0 b b b 

This  univariate  GCD  can  be  computed  using  UN I GCD  or  Modular 
GCD  Algorithms  serially  or  successively  on  pairs  of  polyno- 
mials, e.g.  gcd(. . . (gcd(F0  , FI  ),...),Fk  ).  If  UNIGCO  is 

b b b 

used,  then  a similar  generalization  of  that  algorithm  ui  1 1 
enable  it  to  handle  several  polynomials  at  once.  Houever, 


Wllljpil**,,  I I,  I 
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even  in  that  case  the  gcd  of  several  modular  images  would 
have  to  be  computed  serially,  so  that  our  parallel  computa- 
tion uill  eventually  end  in  a serial  calculation  in  a sub- 
domain. That  is  why  we  only  call  this  a semiparallel  pro- 
cess. 

Step  III:  (Preparation  for  Hensel  Construction) 

If  deg(C  (x))  - 0,  then  C (x)  - 1 and  C - 1.  If  Fj  is 

0 0 

the  polynomial  of  minimum  degree  in  x among 

F0,  ...  , Fk  and  deg(C  ) - deg(Fj),  then  either  Fj 

0 

divides  all  other  polynomials  or  a new  valid  evaluation 
should  be  made.  Otherwise,  determine  if  the  following 
condi tion  holds: 

Condition  III  - A:  There  exists  a j in  10,1 k) 

such  that  gcd(C  , Fj  /C  ) - 1. 

0 0 0 

If  the  condition  fails  to  hold  on  any  of  the  polynomials, 
then  the  special  case  method  which  is  also  similar  to  that 
oi  ‘he  EZGCD  Algorithm  will  have  to  be  used.  The  difference 
betueen  handling  two  or  several  polynomials  for  the  special 
case  is  again  so  small  that  we  will  omit  presenting  the 
method  here. 

Assuming  Condition  III -A  holds  for  some  j,  we  continue 

under  the  further  assumption, 

Condition  III  - B:  deg(C  (x> ) - deg(C). 

0 
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Again  similar  to  the  EZGCD  Algorithm,  this  C is  supposedly 
unknown  so  that  this  condition  cannot  be  tested  at  this 
point.  However  we  will  endeavor  to  increase  the  probability 
of  having  it  hold,  at  various  places  of  the  algorithm.  In 
addition,  Step  V will  provide  a safety-valve  test  to  ensure 
that  any  unlucky  evaluation,  which  causes  Condition  III-B  to 
be  false,  will  be  detected. 

Step  IV:  (Appl ication  of  the  Hensel  Construction) 

Suppose  Fj  satisfies  condition  III-A.  Apply  the  Hulti- 

variate  EZ  Algorithm  (II -5.4)  on  Fj,  C (x),  and 

F j (x)/C  (x)  to  get  ei ther 
b 0 

(a)  multivariate  cofactors,  C"  and  H",  of  Fj  such  that 
Fj  ■ C"  H"  over  Z,  or 

(b)  some  q , n,  C , and  H such  that 

k n n 

n 

F j - C H (mod  q , S ) . 


n n k 

Step  V:  (Verification  of  Results) 

For  case  (a),  test  whether  C”  divides  the  other  polyno- 
mials Fi  not  equal  to  Fj.  I f so,  C - C"  i s the  GCD  we 
seek,  so  that  C’  - TC  C is  the  content  we  seek  and  the 
primitive  part  can  be  computed  by  multiplying  quanti- 
ties already  computed.  Otherwise,  or  far  case(b),  go 
back  to  Step  II  for  a new  evaluation  and  a resulting 


• xi  . '■  ■ ■ - - 1. , : 
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univariate  GCD  having  a smaller  degree  than  this  C . 

0 


The  theoretical  justifications  for  this  algorithm  has 
esentially  already  been  provided  by  Theorems  III —2 . 2 and 
1 1 1-2.3.  Therefore,  ue  only  refer  to  them  keeping  in  mind 
the  generalization  to  considering  several  given  polynomials 
at  once. 


VI  - 3 Computing  Cost  Estimation  and  Conclusions 

Because  of  the  similarities  of  the  EZCONTENT  and  the 
EZGCD  Algorithm,  the  cost  of  computing  the  content  and 
primitive  par?  of  a v + 2 variable  polynomial 


P(z,x,y  ,...,y  ) is  about  the  same  as  computing  the  GCD  of 
1 v 


some  v + 1 variable  polynomials.  For  this  algorithm, 
assumptions  (ASfll)  - (ASM4)  nil  I also  be  made,  except  that 
(ASM3)  should  be  changed  to 

(ASM3’)  The  cost  of  computing  GCD’s  for  polynomials  of  v 

variables  or  less  is  negligible  compared  to  the  cos 
of  computing  the  GCD  and  cofactors  of  *erm  primi- 
tive parts  of  the  v + 1 variable  coefficient  poly- 
nomi a I s. 


Under  these  assumptions,  we  get  a computing  cost  estimate 
for  this  algorithm  parallel  to  (E3 ) of  the  EZGCD  Algorithm: 
T(sub((cl"/cl)  C)  T (sub (cl  HI)  + 


f T ( c I " Fj)  + T (sub ( (cl "/cl ) C)  + T (sub (c I H) ) ] (n/v) 
uhere  C - gcd(F0,  FI,  ...  , Fk),  cl  - lc(C),  cl"  is  the  GCD 

i 

of  the  leading  coefficients  of  F0,  FI,  ...  , Fk,  H is  the 
co-divisor  of  C in  Fj  uhich  is  one  of  the  Fi’s  satisfying 
Condition  III —A  and  used  for  the  Hensel  construction,  and 
n is  the  maxi  sum  term  degree  for  cl"  Fj. 

If  the  additional  assumptions  (ASMS)  and/or  (ASfIG)  are 
made,  ue  also  get  formulas  similar  to  (E4)  - (EG)  for  the 
EZGCO  Algorithm.  But,  more  importantly,  ue  also  get  an 
expression  similar  to  the  well  substantiated  formula  (E7) 
as  the  proportional  computing  cost  for  the  EZCONTENT 
A I gor i thm: 

T ( sub  ((cl"  'cl ) C)  T (sublet  H))  0T(sub(cl"  Fj)) 

Uith  this  formula,  ue  can  conclude  that  the  cost  for 
computing  content  and  primitive  part  of  a polynomial  Is 
roughly  proportional  to  the  number  of  terms  in  the  smallest 
coefficient  polynomial.  This  interpretation,  in  essence, 
gives  validity  to  assumption  (ASt13)  for  the  EZGCD  Algorithm. 

After  the  many  cases  of  empirical  examples  shoun  in 
Section  IV-3,  one  can  readily  observe  that  the  computing 
time  for  contents  and  primitive  parts  using  the  EZCONTENT 
Algorithm  can  be  much  faster  than  the  more  usual  uays  of 
serially  applying  other  GCD  algorithms.  Thus  ue  ui 1 1 not 
attempt  to  create  separate  examples  to  re-emphasize  this 


225 


point  by  exhibiting  additional  experimental  data. 

The  definition  for  GCO  of  several  polynomials  (Section 
1-2)  makes  no  stipulation  that  this  GCD  should  be  computed 
successive'y  and  pairwise.  In  fact,  according  to  the 
definition  it  is  more  natural  the  other  way,  because  the  GCD 
is  simply  the  greatest  common  divisor  of  all  the  polynomials 
taken  simultaneously  rather  than  taken  in  a specific  order. 
Ue  hope,  our  presentation  of  the  EZCONTENT  Algorithm  further 
demonstrated  and  emphasized  this  view  point.  The  polynomial 
content  operation  is  an  important  example  of  the  need  for 
parallel  GCD  computations  and  the  importance  of  the 
primitive  parts  shows  the  usefulness  of  getting  the 
cofactors  of  a GCO  as  byproducts. 


ifcWi  iiinmmiiirii  i ■-  --  — — -----  ■ 
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CHAP  TER  VII  SQUARE-FREE  DECOtIPOS  I T I ONS 


VII  - 1 Introduction  and  Overview  of  EZSQFR 


Square-free  decompositions  of  polynomials  have  many 


uses.  Among  them,  the  most  important  ones  are  in  polynomial 


factorization  and  partial  fraction  decn”;o9i tlon  of  rational 


functions  (refer  to  Appendix  for  definitions).  These  are, 


in  turn,  indispensable  tools  of  rational  function 


integrations  tf10SS7] , (M0S71J,  IH0R711,  IRISG9).  Horowitz 


IH0R71)  made  a quite  complete  survey  and  analysis  on 


computing  square-free  decompositions,  partial  fraction 


decompositions  and  rational  function  integrations  for  the 


univariate  case.  Many  existing  symbolic  manipulation 


systems  such  as  MACSYT1A  IHAC73)  and  SAC-1  (H0RG91  also 


contain  algorithms  for  computing  square-free  decompositions 


of  multivariate  polynomials,  flusser (MUS711  presented  a 


slightly  improved  version  of  the  above  mentioned  more 


classical  algorithm.  He  will  now  present  brief  outlines  of 


these  square-free  decomposition  algorithms  for  multivariate 


polynomials  and  then,  in  parallel,  give  a rough  overview  of 


the  new  EZSQFR  Algorithm  which  uses  the  Hensel  construction. 


Given  a multivariate  polynomial  P(x,y y ) in 

1 v 


Z (x, y ,...,y  1,  which  will  be  assumed  primitive  w.r.t.  x, 
1 v 


Preceding  page  blank 
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the  following  square-free  decomposition  algorithms  .Jill 

compute  square-free  polynomials  P (x,y  ),  i-l,...,k, 

i 1 v 

2 k 

(some  of  them  equal  1)  such  that  P ■ P P ...  P , for  some 

1 2 k 

k > 1 and  P / 1. 
k 

First  we  give  a simple  and  direct  proof  of  a funda- 
mental fact: 

2 k 

Theorem  VII  - 1.1:  l f P - P P . . . P i s pr i m i t i ve  i n 

1 2 k 


! 


except  one,  i (dP  /dx),  because  P ia  square-free  and 


2 k-1 

primitive  w.r.t.  x.  Thus  P P ...  P is  actually  the 

2 3 k 

greatest  common  divisor. 

(i)  Classical  Method  (Horowitz  and  Tobey) » 

Set  u gcd(P,  dP/dx),  P <--  H,  i <--  1. 

e 

Loops  Set  P <—  D,  □ <—  gcd(P,  rfP/dx),  P - P /P/D, 

i i-1 

i <—  i + 1,  Q’.  Loop  unless  D - 1. 

(ii)  Improved  Algorithm  (Musser)s 

Set  C « gcd(P,  dP/dx),  D <—  P/C  , i <—  1. 

1 1 1 

Loop:  Set  0 <--  gcd(C  , D ),  C <—  C /D  , 

i+1  i i i+1  i i+1 

P <—  0 /D  , i <—  i+1,  go  Loop  unless  D - 1. 
i i i +1  i 

(iii)  EZSQFR  (for  primitive  polynomials): 

Set  0 < — EZGCOIP,  dP/d>)  and  L <—  P/D. 

Let  b ■ (b  ,...,b  I be  the  lucky  evaluation  for  P used 
1 v 

in  the  above  GCO  computation.  Set  i < — 1. 

Loop:  Set  H <—  igcd(L  lx),  D (x)),  G <—  L /H  , 

0 b b 0 b 0 

0 <—  D /H  , and  L <—  H . If  G =1,  then 
b b 0 b 0 0 

set  P <—  1;  otherwise  G + 1,  then  apply  the 

1 0 


Multivariate  EZ  Algorithm  on  L,  G , and  H to 

0 8 

get  multivariate  G and  H such  that  L • G H,  and 

sat  P < — G and  L < — H.  Set  i < — i + 1.  If 

i 

D (x)  / 1,  go  Loop,  else  set  P <--  L and  return 
b i 

P , P .....  P uhere  k - i. 

1 2 k 

Upon  more  careful  examination  of  these  three  algori- 
thms, it  is  quite  clear  that  the  difference  between  algori- 
thm (i)  and  ( i i ) is  rather  small.  Some  unnecessary 
differentiations  are  done  in  (i),  and  the  computations  of 
GCD’s  in  (ii)  in  general  involves  smaller  polynomials 
than  in  (i).  The  GCO  computations  in  the  Loop  of  (i) 
and  (ii)  all  involve  mu! .ivariate  polynomials.  They  can 
be  very  time  consuming.  If  polynomial  GCO  Algorithms,  such 
as  the  Modular  and  EZGCO  Algorithms  which  compute  the  co- 
factors as  byproducts,  are  used,  then  both  (ii)  and  (iii) 
involves  no  polynomial  divisions  at  all  uhere  as  (i)  has  to 

do  one  division  for  each  P . In  addition,  the  most  impor- 

i 

tant  advantage  that  EZSQFR  Algorithm  has  over  the  other  tuo 
methods  is  that,  except  the  first  GCD  computation,  all  other 
GCO’s  are  univariate  operations.  In  fact,  uithin  the  Loop 
of  EZSQFR  Algorithm,  the  only  multivariate  operations  are 
done  on  L in  the  application  of  the  Multivariate  EZ  Algo- 
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rithm  for  constructing  G and  H.  Therefore  we  will  now  die- 
cuss  this  EZSQFR  Algorithm  for  computing  multivariate  poly- 
nomial square- free  decompositions  in  more  detail. 

VII  - 2 The  EZSQFR  Algorithm 

From  the  outline  of  the  last  section,  It  is  clear  that 
the  EZSQFR  Algorithm  is  also  based  on  the  evaluation 
homomorphism  and  the  Hensel  construction.  As  we  have  seen 
for  the  EZGCQ  Algorithm  in  Chapter  III,  it  is  quite  obvious 
that  the  modular  homomorphism  and  the  Zassenhaus’  Quadratic 
Extension  Algorithm  can  be  used  for  computing  square-free 
decompositions  of  univariate  polynomials,  similar  to  the 
multivariate  situations.  Again,  we  will  not  discuss  the 
univariate  case  in  any  detail,  due  to  the  ample  similarities 
between  the  tuo  cases. 

For  the  multivariate  EZSQFR  Algorithm  we  will  first 
1'iscuss  a useful,  timesaving  device  for  the  computation  of 
square-free  decompositions,  due  to  a suggestion  by  Uang.  A 
given  multivariate  polynomial  could  already  be  square-free 
with  respect  to  a particular  variable.  If  this  fact  can  be 
detected,  at  a relatively  small  cost,  before  the  entire 
machinery  of  the  square-free  algorithm  begins  to  work,  there 
can  be  big  savings  in  computing  time.  There  is  such  a time 
saving  test  which  we  shall  call  fai I -safe  square-free  test 
(f.s.s.f.).  This  test  consists  of  (a)  evaluating  the 
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polynomia,  at  random  by  chosen  valid  points,  (b)  computing 

the  GCO  of  the  resulting  univariate  polynomial  and  its 

derivative  w.r.t.  the  main  variable,  (c)  checking  to  see  if 

the  GCO  is  a integer  or  not.  if  so  then  the  original 

polynomial  is  square-free.  The  validity  of  this  test  is 

shoun  by  the  following  lemmas 

Lemma  VII  - 2.1s  Let  b ■ lb  ,...,b  ) be  an  arbitrary 

1 v 

valid  evaluation  for  P(x,y  ,...,y  ) in  Zlx.y  ].  If 

1 v 1 v 

P (x)  - P(x,b  ,...,b  J is  square-free  in  ZtxJ  then  P is 
b 1 v 

itself  square-free. 

2 

Proof:  If  P is  not  square-free,  P - Q R , then  via 

2 

the  evaluation  homomorphism  P ■ Q R which  cannot  be 

b b b 

square-free.  // 

The  usefulness  of  this  test  comes  from  the  next  lemma,  uhich 

is  also  stated  and  proved  by  Uang  and  Rothschild  IU&R73) : 

Lemma  VII  - 2.2s  If  P is  a square-free  multivariate 

polynomial  in  Zlx.y  ,...,y  ],  then  a set  of  integers 
1 v 

b -■  lb  ,...,b  1 can  be  chosen  so  that  b is  a valid  evalua- 
1 v 

tion  for  P (i.e.  deg  IP)  - deg  IP  (x) ) and  P (x)  is  also 

b b 

square-free. 
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j 

i 

i 


Proof:  Let  P ■ PI  P2  ...  Pk  be  the  factorization  of  P 

into  irreducible  factors  where  gcdIPi,  Pj)  ■ 1 for  all  i and 

j 9uch  that  i f j.  P (x)  ie  equare-free  if  and  only  if 

b 

Pi  (x)  i 9 square-free  for  all  i and 
b 


tjccKI’l  (x),  t’j  (k)I  « constant,  for  alt  I,  J,  I f J,  which 
b b 

is  equivalent  to  rea I t (Pi  (x),  Pj  (x))  / 8,  for  all  i,  j, 

b b 

i / j,  uhere  "resit"  denotes  the  resultant  u.r.t.  x [VDU491 . 
Let  R(x,y ) -IT  res  1 1 (P i . dPi/dx)  | | res  1 1 (Pi , Pj). 


1 


I I 


l<j 


Then  R / 0 since  P i9  square-free  and  P (x)  is  square-free 

b 


if  and  only  if  R (x)  / 0.  Now  it  suffices  to  note  that 
b 


there  are  only  finitely  many  integral  values  for 

b , ...  , b such  that  R (x)  ■ 0 or  ( I c (P) ) » 0.  // 

1 v b b 

The  finiteness  of  integral  roots  for  R(x,y  y ) has 

1 v 

the  further  implication  that  out  of  the  infinite  possibi- 
lities of  integral  values  for  each  b , the  probability  for 

choosing  a valid  but  unlucky  set  b,  such  that  P (x)  becomes 

b 

not  9quarc-fres  uhen  P i s,  cannot  be  too  large.  He  will 
apply  this  test  at  the  beginning  of  the  EZSQFR  Algorithm 
with  at  most  tuo  valid  evaluations.  If  the  test  fails,  we 
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simply  assume  no  information  was  gained  and  continue  with 
the  rest  of  the  EZSQFR  Algorithm.  Houever,  we  point  out 
that  not  all  the  work  done  for  the  test  was  wasted  in  this 
case,  since  the  computations  done  for  the  f.s.s.f.  teet  is 
useful  for  the  EZGCO  Algorithm  when  it  ie  used  to  compute 
the  square-free  part  of  the  given  polynomial  (the  D of  tiii) 
in  the  last  section). 

Next,  we  will  prove  Lemma  (SI)  already  stated  in 

Section  1 1 1 -4  uhich  will  eliminate  the  need  for  using  the 

spe:ial  case  method  of  the  EZGCD  Algorithm  when  it  is 

invoked  to  compute  □ - gcd(P,  dP/dx). 

Lemma  VII  - 2.3:  (same  as  1 1 1 — (SI ) ) 

Let  P be  primitive  in  (Z(y  , ,.,,y  l)(x]  and 

1 v 

R - ycdlP,  dP/dx),  then  gcd(R,  (dP/dx) /R)  - 1. 

2 k 

Proof-  Let  P - P P ...  P and  Q - dP/dx.  By  Theorem 
1 2 k 

1 2 k-1 

VI I -1.1,  R - P P ...  P . Assume  D - gcd(R,  Q/R)  / 1, 

1 2 k 

then  Q » R D Q"  f c.  some  Q".  Since  D divides  R,  there  must 

be  an  irreducible  factor  C / 1 of  some  P , i > 2.  But 

i 

this  C clearly  also  divides  P/R  so  that  P « R C P"  for  some 
P".  Thus,  R C divides  both  P and  Q,  contradicting  to  R 
boing  the  GCO.  Therefore,  gcd(R,  Q/R)  must  be  1.  // 


iiriiiiaiiiiaiiiMiiiiii  mi- 


This  lemma  and  the  finiteness  of  the  number  of  unlucky 
evaluations  imply  that  Condition  1 1 1 -A  of  the  EZGCO  Algo- 
rithm can  aluays  be  satisfied  for  the  polynomial  dP/dx. 
Hence,  only  Steps  (Ai)  - (AS)  of  the  EZGCO  Algorithm  uill  be 
used  for  computing  gcdtP,  dP/dx)  and  the  special  case  method 
Steps  (SI)  - (S7I  can  be  completely  avoided  for  this  square- 
free  decomposition  algorithm. 

He  are  ready  nou  to  describe  the  EZSQFR  Algorithm  in 
detail.  However  ue  uill  not  be  so  careful  as  to  indicate, 
for  example,  how  the  computations  for  f.s. s.f.  test  is  used 
in  the  EZGCO  Algorithm  cr  uhat  is  altered  in  the  EZGCD 
Algorithm  when  it  is  known  that  Condition  1 1 1 -A  uill  hold 
for  one  input  polynomial  eventually.  These  changes  are 
actually  very  simple  if  ue  carefully  look  at  the  EZGCD 
Algor i thm  again. 

If  the  given  polynomial  is  not  primitive,  then  dearly 
ue  can  uork  on  its  content  and  primitive  part  separately. 
Because  of  the  fad  that  square-free  decompositions  are  main 
variable  dependent  and  that  for  some  uses  it  is  not 
necessary  to  square-free  decompose  the  content,  ue  uill 
assume  primitive  inputs  to  the  EZSQFR  Algorithm. 

Algorithm  VII  - 2.4:  (EZSQFR) 

Input:  A primitive  multivariate  polynomial 

P(x,y  ,...,y  ) in  Z[x,y  ,...,y  ) and  the  main  variables,  x. 

1 v 1 v 


— — 
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Output:  A list  of  polynomials  in  Z[x,y ], 

1 v 

2 k 

P , P , ...  , P , such  that  P ■ P P ...  P for  some  k > 1 
12  k 12k 

and  P / 1. 
k 

(1)  invoke  the  f.s.s.f.  test  twice  with  random  valid  eva- 
luations. if  the  test  is  successful,  then  return  P, 

■ otherwise  continue. 

(2)  Apply  the  EZGCD  Algorithm  on  P and  dP/dx  to  obtain 

D * gcd(P,  dP/dx)  and  L - P/D.  Let  b be  the  valid 

lucky  evaluation  for  P and  P be  the  lucky  prime  for 

P (x)  used  in  the  EZGCD  Algorithm  above, 
b 

(3)  Set  Lb  <—  L(x,b  ,...,b  ),  Db  <—  Dix.b b ),  and 

1 v 1 v 

i <--  1. 

(4)  Apply  UNIGCD  Algorithm  (or  some  other  univariate  GCD 
Algorithm  which  also  computes  cofactors)  on  Lb  and  Db 
to  obtain  H0  <—  gcdiLb,  Db),  G0  <--  Lb/H0,  and 

Db  < — Ob/H0.  Set  Lb  <—  H0. 

(5)  . i f G0  » 1,  then  set  P < — 1.  Otherwise,  apply  the 

i 

Multivariate  EZ  Algorithm  on  L,  G0,  and  H0  to  get  mul- 
tivariate polynomials  G and  H such  that  L - G H over  Z. 
Set  P <--  G,  L < — H,  and  I < — i + 1. 


... 
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(G)  If  Ob  ■ 1,  then  go  to  14).  Otherwise,  set  k < — i, 

P <--  L,  and  return  P , P , , P . 

k 12k 

Remark:  Note  that  even  the  evaluation  of  multivariate 

polynomial  L and  0 in  Step  (3)  need  only  be  done  once.  The 

later  values  for  Lb  and  Ob  are  simply  updated  with  known 

quantities.  Step  (4)  involves  only  one  univariate  GCD 

computation.  For  the  application  of  the  Multivariate  EZ 

Algorithm  in  Step  (5),  the  required  lucky  evaluation  b and 

prime  p is  again  provided  by  the  EZGCO  Algorithm  in  Step 

(2).  Thus  the  results  of  that  computation  are  certain  to  be 

correct  over  Z.  Also,  it  is  possible  to  avoid  the  leading 

coefficient  blowup  problem  by  applying,  instead  of  the 

Multivariate  El  Algorithm,  Algorithm  1 1-5.5  in  a way  similar 

to  that  of  the  EZGCD  Algorithm  in  Section  III-4(a).  Here  ue 

can  use  the  successively  decreasing  leading  coefficient  of  L 

as  the  leading  coefficient  to  be  forced  onto  G0.  That  is 

instead  of  using  L,  G0,  and  H0,  we  use  ( I c (L) . L), 

C I c (L) ) (G0/lc(G0)),  and  lc(G0)  H0  together  with  I c <L)  as 

b 

the  leading  coefficient  replacing  leading  coefficients  of 
both  univariate  polynomials. 


VI I - 3 Conclusions 

It  is  quite  obvious  that  the  EZSQFR  takee  great  advan- 


tages  of  computations  already  performed  to  achieve  added 
efficiencies.  By  performing  Hense I constructions  instead  of 
full-fledged  multivariate  GCD  computations,  the  gains  in 
efficiency  is  clear.  Even  the  preparations  for  these  appli- 
cations of  the  Multivariate  EZ  Algorithm  are  very  simple. 
They  involve  essentially  only  one  univariate  GCD  computation 
as  preparation  for  each  Hensel  construction,  hence  for  each 

P . That  is  much  less  costly  than  uhat  EZGCD  Algorithm  goes 
i 

through  for  preparing  Hensel  construction.  All  the  choosing 
of  lucky  evaluations  and  primes,  testing  for  various  condi- 
tions et  ...  are  eliminated  because  the  evaluation  values  and 
prime  are  known  to  be  lucky  after  the  first  call  to  EZGCD. 

3 

Whenever  P - 1 for  some  i (for  example,  P - U V , then 
i 

P - U.  P - 1,  P - v),  the  computational  process  is  even 
1 2 3 

simpler  - only  one  univariate  GCO  computation  is  necessary 

and  no  multivariate  operation  is  required  at  jil.  Compared 

to  method  (ii),  we  would  find,  in  this  case, 

D - gcd(C  . D ) - 0 , so  that  P - D /D  - 1,  i.e.  D 
i+1  i i i i j j+i  ; 

divides  C . Even  so,  there  is  still  at  least  one  multiva- 
i 

riate  division  involved,  which  is  more  time  consuming  than  a 
univariate  GCO  computation  in  most  cases.  In  conclusion, 
the  EZSQFR  Algorithm  contains  all  the  time  saving  devices  in 
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the  other  two  methods  and  several  more  in  addition. 
Therefore,  the  decrease  in  computing  costs  should  be  rather 
obvious.  Ue  will  not  illustrate  this  uith  empirical  tests 
but  only  re-emphasize  the  following  facts:  (1)  Hensel 
constructions  have  been  shown  to  be  an  efficient 
computational  techniqua  whenever  applicable  for  practical 
multivariate  polynomials,  (2)  the  EZSQFR  Algorithm  needs 
only  one  call  to  EZGCO,  no  multivariate  divisions  at  all, 
and  several  Hensel  constructions  replacing  complete 
multivariate  GCD  computations,  and  (3)  empirical  data  in 
Section  IV-3  have  shoun  clearly  how  the  Hensel  approach  to 
GCD  computations  can  be  advantageous  over  other  methods. 


... 


CHAPTER  VII 1.  CONCLUSIONS  ANO 


SUGGESTIONS  FOR  FUTURE  RESEARCH 

VIII  - 1 Summary  and  Conclusions 

At  this  point,  we  hope  that  the  usefulness  of  the 
Hensel-type  constructions  in  symbolic  and  algebraic  manipu- 
lation has  been  made  clear.  Ue  have  shown  how  far  a rela- 
tively simple  idea  in  p-adic  analysis  can  go  toward  improv- 
ing the  performance  of  algebraic  algorithms.  A great  deal 
of  effort  has  been  given,  in  Chapter  II,  to  clarify  this 
fundamental  concept  and  its  many  generalizations.  The  basic 
idea  in  these  Hensel  constructions  is  to  recover  the  co- 
divisors of  a given  polynomial  in  a higher  domain  from  their 
algebraically  independent  images  in  some  more  structured 
subdomain,  flany  interspersed  examples  have  been  given  to 
help  in  understanding  these  Hensel-type  algorithms.  The 
Generalized  Hensel  Algorithm  for  multivariate  constructions 
brought  back  a seldom  used  but  very  appropriate  way  of 
viewing  a multivariate  polynomial  - as  a generalized  Taylor 
series.  Actually,  that  is  also  a natural  generalization  of 
the  p-adic  representation  for  integers.  The  only  difference 
is  that,  instead  of  using  a integral  prime  as  the  basis  for 
representation,  ue  use  the  degree-one  irreducible  polyno- 
mials of  the  form  (y  - b) , with  some  choser  integral  evalu- 
ation po i n t b fur  the  variable  y.  Although  we  have  noted 

Preceding  page  blank 
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the  difficulties  related  to  transforming  the  polynomials 
from  one  representation  to  the  other,  ue  also  have  realized 
the  gains  in  efficiencies  for  many  algebraic  algorithms  uhen 
the  Taylor  series  forms  are  used.  The  major  emphasis  to  be 
made  here  is  that  only  one  set  of  evaluation  values  are 
necessary  for  these  suitable  applications  of  the  GHA  when 
these  integral  values  are  well  chosen,  and  that  is  exactly 
the  reason  for  the  efficiencies  of  the  algorithms  described 
in  this  thesis. 

The  main  application  of  the  Hensel  construction  is  in 
the  problem  of  computing  polynomial  greatest  common  divi- 
sors. This  is  a very  well -studied  topic  ever  si.nce  the  time 
of  Euclid  (300  B.C.).  In  Chapter  I,  ue  pointed  out  the 
major  draubacks  of  knoun  methods  for  computing  GCO’s.  Ue 
noted  that  the  method  based  upon  tt*  recently  quite  popular 
modular  technique  - the  flodula’  GCO  Algorithm,  actually  only 
gave  us  a "uorst  case"  algorithm.  That  is,  it  essentially 
considers  all  polynomials  to  have  the  biggest  size  possible 
- dense  up  to  the  same  maximum  degree  in  each  variable. 
Computational  experience  tell  us  that  this  is  a very 
unrealistic  and  impractical  assumption  for  GCO  calculations. 

Rational  function  manipulation  and  simplification, 
unich  is  the  basis  many  pouerful  recent  symbolic  manipula- 
tion systems,  in  most  cases  depends  on  the  computation  of 
polynomial  GCD’s.  The  storage  capacity  and  speed  limita- 


tion9  of  present-dciy  computers  impose  bounds  on  the  sizes  of 
expressions  for  wh  h any  non-t-iviai  computations  can  be 
done.  As  far  as  GCD  computations  are  concerned,  we  saw  in 
Section  IV-3  that  the  largest  dense  polynomials,  for  which 
their  GCD’s  can  be  computed  within  reasonable  time  periods, 
have  on  the  order  of  2000  terms  and  those  are  only  dense 
quadratic  polynomials  in  four  variables.  Thus,  if  every  GCO 
computation  assumes  dense  polynomials,  it  would  be  very 
difficult  to  do  any  non-trivia!  calculations  with  multi- 
variate polynomials  of  high  degrees  with  a symbolic  com- 
puting system.  Fortunately,  most  multivariate  polynomials 
of  any  high  degree  are  sparse.  The  EZGCD  Algorithm  based  on 
the  Hensel  construction  takes  advantage  of  the  sparseness  of 
polynomials  whenever  possible  and  achieves  remarkable 
efficiencies  for  many  classes  of  problems.  Some  of  the 
empirical  test  cases  of  Section  1 V— 3 showed  this  efficiency 
very  well.  Ue  should  also  point  out  that  the  HACSYMA  sym- 
bolic manipulation  system  has  been  able  to  perform  many 
previously  nearly  impossible  (in  time  and  space)  computa- 
tions uith  the  newly  implemented  EZGCD  Algorithm. 

Uith  these  computational  expediences  and  the  test  case 
data  listed  in  Section  IV-3,  we  were  able  to  verify  the  use 
fulness  and  correctness  of  the  EZGCO  Algorithm  with  analy- 
tically derived  computing  cost  estimation  formulas.  Chapter 
’V  shows  the  accuracy  of  the  predictions  that  we  were  able 
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to  wake  with  the  intuitively  derived  expressions  for  com- 
puting costs.  Because  of  these  accurate  predictions  of 
actual  run  times,  our  confidence  in  this  wore  Intuitive 
(more  "engineering")  approach  to  computing  time  analysis  has 
been  strengthened.  As  already  noted,  Gentleman  CGEN73]  has 
independently  arrived  at  a similar  conclusion  that  an 
engineering  approach  to  computing  time  analysis  Is  deeirable 
for  complex  algorithms.  Ue  can,  therefore,  conclude  that  in 
the  absence  of  unlucky  evaluation  values  and  non-trivial 
leading  coefficients  causing  severe  blowup  problems  due  to 
non-zero  substitutions,  and  when  special  case  considerations 
are  not  necessary,  the  EZGCO  Algorithm  generally  surpasses 
the  Modular  GCD  Algorithm  in  performance.  Even  in  those 
cases  where  the  Modular  Algorithm  is  supposed  to  be  at  ite 
be9t,  such  as  uhen  the  GC0  is  1 or  for  moderately  dense 
situations,  the  Hensel-type  algorithm  can  perform  equally 
well.  The  only  cases  where  the  Modular  GCD  algorithm  should 
theoretically  win,  are  the  extremely  large  and  denee  caeee 
where  the  multiplication  cost  becomes  a significant  part  of 
or  even  dominates  the  cost  of  Hensel  construction.  But,  as 
the  data  in  Section  IV-3  showed  nearly,  such  problems  soon 
become  prohibitively  large  so  as  to  be  impractical  for 
computation  by  any  method.  So,  we  should  concentrate  mainly 
on  problems  of  a more  practical  size.  In  this  eize  range, 
it  is  possible  for  the  PRS  GCD  algorithms  to  be  more  effi- 
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cient  than  the  EZGCO  Algorithm  in  small  pfoblems  where  the 
number  of  elements  in  the  PRS  sequence  or  the  number  of 
pseudo-divisions  is  very  rr.all  (e.g.  1 or  2 divisions)  so 
that  it  is  impossible  for  large  blouup  to  occur  due  to  large 
pseudo-di vision  multipliers.  Since  a large  number  of  GCO 
problems  deal  with  small  polynomials  (because  the  computa- 
tion of  GCO's  and  contents  is  essentially  a recursive 
process  on  the  variables),  we  believe  that  a combination  of 
the  Reduced  (or  Subresultant)  PRS  GCO  Algorithm  ar.J  the 
EZGCD  Algorithm  will  be  very  advantageous.  For  example,  we 
could  test  if  the  minimum  of  degrees  of  the  two  input  poly- 
nomials is  less  than  or  equal  to  2,  recursively,  each  time 
the  main  variable  changes.  If  so,  the  PRS  GCO  Algorithm  can 
be  used  since  at  most  two  pseudo-divisions  need  to  be  per- 
formed in  this  case.  Otherwise,  we  would  apply  the  EZGCD 
Algorithm.  In  this  manner,  we  can  take  advantage  of  the 
best  aspects  of  both  algorithms  and  achieve  added 
e f f i c i ency. 

The  drawbacks  of  the  EZGCO  Algorithm  should  also  be 
clearly  recognized.  First,  there  is  the  problem  of  unlucky 
primes  end  evaluations  But  this  problem  ie  usually  less 
severe  for  the  EZGCD  Algorithm  than  for  the  Hodular  GCO 
Algorithm  because  the  EZGCO  Algorithm  only  needs  one  lucky 
prime  and  one  iucky  evaluation  value  for  each  non-main 
variab'o.  Uith  careful  screening  of  the  chosen  integral 


values  and  several  intermediate  teste  for  the  luckiness,  as 
ue  have  designed  into  the  algorithm,  thie  problem  can  be 
largely  avoided  at  a very  email  cost.  Next,  there  ie  the 
problem  of  the  desired  GCO  being  not  relatively  prime  to 
either  of  its  codivisors  in  the  two  given  polynomials  (l.e. 
Condition  1 1 1 -A  of  Section  III -2  can  not  be  satisfied). 

For  that  ue  have  designed  the  special  case  method  which 
still  uses  the  Hensel  construction  but  in  a more  indirect 
uay.  Although  this  method  is  slouer  by  being  round-about, 
it  turns  out  that  it  still  performs  quite  competitively  by 
taking  advantage  of  the  special  structures  of  the  given 
polynomials,  as  Case  7 of  Section  IV-3  bears  out.  In  such  a 
case,  the  EZGCD  Algorithm  has  the  blouup  problem  due  to  non- 
trivial leading  coefficient  uhen  applying  the  Hensel  con- 
struction. As  discussed  in  Sect  ions  1 1 -5 (a)  and  III -4 ( a) , 
this  problem  is  caused  by  the  non-uniqueness  of  the  solu- 
tions to  Oiophantine  Univariate  Polynomial  Equations  (DUPE) 
uhich  results  in  arbitrary  multiplications  of  the  codivisors 
by  units  in  the  modular  domain.  This  phenomenon  is  actually 
common  to  most  modular  algorithms  and  our  solution  for  this 
problem  for  the  GCD  computation  Is  someuhat  similar  to  the 
uay  of  getting  around  this  problem  used  by  the  Modular  GCO 
Algorithm.  At  the  cost  of  computing  the  GCD  of  leading 
coefficients  of  the  given  polynomials  and  using  the  product 
of  this  GCD  and  one  original  polynomial  (this  is  uhat  the 
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Modular  GCO  Algorithm  a iso  uses)  for  the  GHA,  we  can  avoid 
this  blouup  problem  by  forcing  the  results  from  the  Hensel 
construction  to  be  uniquely  determined.  Finally,  we  have 
the  yet  unresolved  blouup  problem  duo  to  the  combination  of 
the  requirement  for  valid  and  lucky  evaluational  values  and 
the  need  for  making  a substitution  transformation  on  the 
given  polynomial  for  the  Hensel  construction.  Uhen  the 
evaluation  values  are  forced  by  the  validity  and  luckiness 
requirements  to  be  non-zero,  the  substitution  transformation 
can  increase  the  number  of  terms  in  the  polynomial  for  GHA 
drastically  (up  to  that  of  the  dense  polynomial  uith  the 
same  maximum  degree  in  each  variable,  of  course).  This  is 
the  so  called  the  non-zero  substitution  problem  for  uhich  a 
generally  applicable  solution  is  still  lacking.  Ue  point 
out  that  there  are  various  uays  for  avoiding  such  blouup 
problems  in  some  special  situations.  But,  because  of  the;r 
lack  of  general  applicability,  ue  uill  not  discuss  them  in 
this  thesis.  Ue  uill  emphasize  that  even  assuming  the  uorst 
such  blouup  (to  dense  polynomials),  the  computation  times 
for  practical  sized  problems  are  still  competitive  uith 
those  of  the  Modular  GCD  Algorithm,  as  ue  can  see  from  Case 
5'  in  Section  I V-3  and  by  verifying  uith  the  corresponding 
computing  time  formulas  for  both  algorithms. 

The  last  three  chapters  dealt  with  three  other  applica- 
tions of  the  Hensel  constructions.  The  application  of 
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Hensel  construction  to  the  problem  of  factoring  polynomials 
was  actually  the  initial  work  In  this  direction.  However 
the  multiple  factor  generalized  Hensel  construction  in 
Section  Y-3  can  provide  a noticeable  improvement  to  the 
multivariate  polynomial  factorization  algorithm  when  the 
number  of  potential  factors  is  bigger  than  two.  The  next 
two  applications  of  Hensel  construction  are  actually  out 
growths  from  the  E2GCD  Algorithm  based  essentially  on  two 
key  observations  while  designing  and  implementing  the  GCD 
algorithm.  Realizing  that  the  computation  of  contents  is 
just  the  same  as  computing  the  GCD  of  several  polynomials 
together  and  that  the  application  of  the  Hensel  construct  ior 
needs  only  one  of  the  given  polynomials,  we  can  immediately 
extend  the  EZGCO  A I g-  <j  compute  polynomial  contents 

and  primitive  parts.  The  square-free  decomposition  algo- 
rithm - EZSQFR  is  a direct  extension  of  the  special  case 
method  of  the  EZGCO  Algorithm  for  handling  non-square-freo 
polynomials.  The  key  observation  in  this  case  is  Lemma  VII- 
2.3  which  ensures  the  applicability  of  the  Hensel  construc- 
tion to  the  problem  of  computing  square-free  decompositions 
for  all  polynomials.  A careful  study  of  the  known  square- 
free  decomposition  algorithms  revealed  several  inefficien- 
cies of  these  methods  uhich  the  EZSQFR  Algorithm  can  avoid. 
These  computational  improvemtnts  make  the  EZSQFR  Algorithm 
unquestionably  more  efficient  than  the  other  known  methods. 


VIII  - 2 Prospects  and  Suggestions 

The  vast  possibilities  of  applying  Hensel-type  methods 
are  certainly  not  exhausted  by  this  thesis.  The  univariate 
C.CO  algorithm  discussed  in  Section  II I -3  has  already  pro- 
Vdknd  i»r I I vp  n Indy  and  analysis  of  o'h-r  major  CCD  algori- 
thms for  univariate  polynomials.  Many  interesting  results 
have  been  discovered  by  comparing  GCO  algorithms  using  the 
ordinary  Hensel’s  construction  (Algorithm  1 1-3.1)  and 
Zassenhaus’  Quadratic  Extension  Algorithm  (1 1-3.2)  with 
other  known  methods.  A paper  by  Yun  and  fliola  reporting 

s 

these  activities  is  under  preparation. 

A I inear  version  of  the  Hensel  construction  is  mani- 
tested  in  the  problem  of  solving  the  polynomial  equation 
A F + B G » H wi th  F,  G,  and  H being  the  given  polynomials 
and  A.  B the  unknowns.  The  simplest  such  problem  Is  des- 
cribed in  Lemma  1 1-2.2  and  solvable  by  Algorithm  1 1-2.2. 
Under  suitable  conditions,  such  equations  can  even  be  solved 
for  multivariate  polynomials  over  t>  - integers.  Work  in 
this  aspect  of  utilizing  Hensel-type  methods  has  already 
been  initiated  and  will  be  further  pursued  by  Yun  in  the 
near  future.  Two  immediate  major  applications  of  this  work 
lie  in  the  problems  of  polynomial  division  with  remainders 
(or  pseudo-division)  and  in  partial  fraction  decomposition 
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of  rational  functions.  Ue  will  only  point  out  here  that, 


the  main  variable  is  quite  arbitrary.  As  ue  can  clearly 
realize  in  GCO  computations,  however,  the  choice  of  a main 
variable  can  greatly  affect  the  computing  time.  With 
respect  to  the  EZGCO  Algorithm,  for  instance,  different  main 
variables  result  in  different  leading  coefficients  hence  in 
different  lucky  evaluation  values  and  more  or  fewer  zeros 
for  the  substitution  transformation.  Up  to  now,  the  choice 
of  main  variable  seems  to  be  more  of  an  art  thin  a science. 
Thus,  it  will  be  a very  worthwhile  undertaking  to  see  if  a 
more  methodical  way  of  choosing  a main  variable  can  be  found 
for  the  EZGCO  Algorithm,  the  other  GCO  algorithms,  and  other 
polynomial  operations. 

(3)  The  multiplication  of  multivariate  polynomials 
seems  to  be  far  from  being  well  understood.  For  dense 
polynomials,  the  method  of  fast  finite  Fourier  transform 
achieves  the  currently  known  minima!  theoretical  computinn 
time  - T (P*Q) log(T(P*Q))  IB0N73] . For  the  completely  sparse 
case,  the  best  known  method  has  a computing  time  propor- 
tional to  T(P*Q) I og (T (P) ) (ALTRAN  [3R072] ) when  sorting  of 
terms  in  the  product  into  a canonical  representation  is  also 
considered.  Furthermore,  ue  believe  that  a even  better 
method  (cost  proportional  to  T(P)  T (Q) ) can  be  found  if  the 
ordering  of  terms  in  both  input  polynomials  are  taken  into 
consideration.  Then,  what  about  intermediately  sized  poly- 
nomial mu'  tip! ications?  Are  these  known  methods  actually 
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optimal?  Certainly,  an  understanding  of  polynomial  multi* 
plication  is  extremely  important  for  the  algorithmic  studies 
of  more  complex  algebraic  operations. 

(4)  Polynomial  division  is  equally  as  important  as 
multiplication.  As  mentioned  above,  a variation  of  the 
Hensel  construction  already  provides  a neu  way  of  performing 
a division  with  remainder.  A byproduct  of  that  method  is  a 
fail-safe  way  of  doing  test-division  which  is  an  important 
verification  step  of  the  EZGCD  and  the  Modular  GCD  A I go- 

r i thms. 

(5)  Polynomial  divisions  and  pseudo-divi s ions  lead 
naturally  to  PRS  sequences  for  computing  GCD’s  and  result- 
ants. There  are  still  many  unanswered  questions  regarding 
PRS  algorithms.  For  instance,  is  the  Subresultant  PRS  GCD 
Algorithm  the  best  one  of  its  kind?  Section  3.7  of  [BR071J 
answers  this  negatively,  but  some  theoretical  and  algori- 
thmic details  have  yet  to  be  worked  out.  A non-trivial 
improvement  to  the  Subresultant  PRS  GCD  Algorithm  uas  dis- 
covered and  presented  by  Hearn  in  1972  [HEA72] . It  has  been 
shown  by  the  EZGCD  Algorithm  that  the  Hensel-type  construc- 
tion works  quite  uel I in  GCD  computations.  Can  a similar 
Hense I -re  I a ted  method  be  used  for  the  computation  of  poly- 
nomial resultants? 

(G)  As  a matter  of  habit  from  algorithmic  analysis  in 
complexity  theory,  ue  have  been  taking  the  cost  of  polyno- 


mial  (versus  integer,  in  complexity  theory)  multiplication 
and  division  as  the  basic  operations  to  be  accounted  for  in 
algebraic  algorithmic  analysis.  Our  more  intuitive  method 
of  analysis  has  hopefully  shoun  that  complex  polynomial 
(especially  multivariate)  operations  often  involve  many 
other  equally  important  basic  operations  which  also  contri- 
bute significantly  to  the  total  computing  cost.  In  the  case 
of  the  Henset  construction  (GHA)  and  the  EZGCD  Algorithm,  we 
were  able  to  pin  point  one  particular  parameter,  the  number 
of  terms,  as  the  major  variable  on  which  the  total  computing 
cost  depends.  Can  one  find  such  a parameter  for  other 
polynomial  algorithms?  The  uork  that  remains  to  be  done  in 
this  respect  is  to  establish  some  order  in  this  complex 
environment  by  finding  the  relative  cost  differences  among 
several  important  basic  operations.  For  example,  the  cost 
of  forming  the  product  of  two  polynomial  terms  in  relation 
to  comparing  two  terms  and  determining  their  relative  posi- 
tions  in  some  canonical  ordering  of  polynomial  terms.  Once 
such  a ranking  of  cost  units  for  major  relevant  basic  oper- 
ations is  obtained,  it  will  be  much  easier  to  perform  accu- 
rate computing  time  analyses  and  to  pin  point  important 
cost-affecting  parameters.  The  ultimate  goal  for  timing 
analysis  should  be  predictive  accuracy  for  computational 
algorithms  with  realistic  problems. 

(7)  A very  simple  but  generally  useful  substitution 


operation  is  necessary  for  the  Hensel  construction.  Ue 
found  out  that  the  more  straight  forward  method  of  substi- 
tuting term  by  term  is  actually  faster  than  that  by  Horner’s 
rule,  contrary  to  the  general  impression  due  to  the  effi- 
ciency of  the  Horner’s  rule  in  polynomial  evaluations 
[H0R731 . Thus,  further  understanding  of  and  a better  method 
for  substitution  of  polynomials  into  polynomials  still  seems 
to  be  in  demand.  Related  to  the  substitution  problem  and, 
of  course,  to  the  Hensel  construction  as  well  as  the  EZGCD 
Algorithm,  is  the  open  problem  of  blowup  due  to  non-zero 
substitutions  mentioned  again  in  the  last  section.  If  this 
problem  can  be  successfully  avoided,  then  the  class  of 
problems  uhere  the  EZGCD  performs  well  can  be  greatly 
enlarged. 

(8)  The  use  of  the  Hensel  construction  in  polynomial 
factorization  and  GCO  computations  theoretically  requires  a 
bound  on  the  integral  coefficients  of  the  factors  or  divi- 
sors of  the  given  polynomials.  Although  the  computational 
algorithms  easily  and  successfully  get  around  this  need  by 
calculating  only  a heuristic  bound  supported  by  some  final 
tests  for  the  sufficiency  of  this  bound,  it  is  still  highly 
desirable  to  be  able  to  compute  a good  true  bound  at  the 
start.  Knuth  [KNU69]  gives  several  good  bounds  for  the 
factors  of  univariate  polynomials.  But  for  multivariate 
polynomials,  his  methods  are  either  not  general izab I e or  too 
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time  consuming.  Other  quickly  computed  absolute  bounds  for 
multivariate  polynomial  factors,  such  as  the  one  given  by 
flusser  Cf1US731 , are  unrealistically  large.  So  we  desire  a 
quickly  computable  tight  bound  for  the  integral  coefficients 
of  factors  of  arbitrarily  given  multivariate  polynomials. 

(9)  Now  that  the  Hensel  construction  makes  factori- 
zations for  polynomials  a computationally  feasible  task,  It 
appears  reasonable  to  expect  to  encounter  more  and  more 
factored  polynomials  in  symbolic  computations  in  the  future. 
Factored  polynomials  are  in  general  smaller,  more  compre- 
hensible, and  give  more  insight.  However,  we  have  been  so 
accustomed  to  expanded  representations  of  polynomials  that 
little  is  knoun  about  how  to  perform  such  a basic  operation 
as  division  efficiently  when  either  or  both  polynomials  are 


in  factored  forms.  Uork  on  this  aspect  will  definitely  be 


a 


APPENDIX:  BASIC  ALGEBRAIC  CONCEPTS  AND  ESSENTIAL  NOTATIONS 


In  this  section,  we  ui  1 1 review  some  basic  concepts  in 


modern  algebra  EHER64]  IB4M65] , which  will  be  used  freely 


throughout  this  thesis.  Ue  also  establish  some  notations 


uhich  will  be  essential  to  facilitate  our  expositions. 


In  an  algebraic  ring,  uni ts  are  simply  divisors  of 


unity  or  elements  having  inverses  in  the  domain,  zero-d I vi- 


sors are  divisors  of  zero.  A commutative  ring  is  an  in te- 


ar a I domain  if  it  has  no  zero-divisors.  In  a field,  all 


non-zero  elements  are  unite;  in  the  domain  of  integers, 


denoted  by  Z,  the  only  units  are  1 and  -1.  Elements  of  an 


integral  domain  that  divide  each  other  are  called  assoc  1- 


A binary  relation  is  called  an  equivalence  relation  if 


it  satisfies  reflexive,  symmetric,  and  transitive  proper 


ties.  Equivalence  relations  divide  a domain  into  eau i va- 


I ence 


and  it  is  often  convenient  to  choose  one 


simple  element  out  of  each  equivalence  class  as  a canonical 


jresentat i ve  and  define  it  to  be  uni t normal . The 


relation  of  associativity  is  an  equivalence  relation  and 


decomposes  the  integral  domain  into  associate  classes. 


For  a given  element  q in  an  integral  domain  I,  the 


relation 


modu I o g,  defined  by  a,b  in  I is 


a-b  (mod  q)  iff  (if  and  only  if)  q divides  a-b,  is  an 


E 
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equivalence  relation.  Ue  uiil  let  (Z/q)  denote  the  set  of 
equivalence  classes  with  respect  to  the  congruence  relation 
modulo  9ome  positive  integer  q.  Z/Z  will  denote  the  field 
of  rational  numbers. 

An  element  of  an  integral  domain  Is  said  to  be  irrsdu- 
i Mile  if  its  only  divisors  are  units  and  associates.  An 
integral  domain  in  which  every  element  can  be  represented 
uniquely  (up  to  associativity)  as  a product  of  irreduciblee 
i 3 called  a unique  factorization  domain  (u. f.d.). 

Let  J be  a unique  factorization  domain,  then  JIx]  Is 
also  a u.f.d.  (HER64)  uhose  units  are  units  of  J.  In  Jlx], 
ue  assume  that  the  terms  of  a polynomial  P are  arranged  in 
the  order  of  decreasing  exponents, 
n n-1 

P-cx  +c  X +...+CX+C. 

n n-1  1 0 

The  coefficient  of  the  highest  non-zero  power  of  x is  called 
the  leading  coefficient.  I c (P) , and  the  coefficient  of  the 
louest  ncn-negative  pouer  of  x i9  called  the  trai I ino  coe- 
fficient. tc!P).  A polynomial  P is  called  mon i c if 
I c (P)  * 1.  A polynomial  i9  considered  posi five  if  its 
leading  integral  coefficient  ( leading  coefficient  of  each 
variable  taken  rt  ursively)  is  positive. 

Final  ly  ue  define  sauare-f’-ee  decomposition  of  a poly- 
nomial in  J[x)  and  oar  t i a I fraction  decomposi tion  for  a 


, Lai-.-' ^ ..  - . .i ; ■ i A 
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rational  function  where  the  numerator  and  the  denominator  are 
both  polynomials  in  JIxl.  A polynomial  in  J[x]  is  called 
square-free  if  all  of  its  non-trivial  factors  (i.e.  factors 
of  positive  degree  in  x)  occur  not  more  than  r~ce..  The 
square-free  decomoosi  t ion  of  a polynomial  Q(x)  in  J[x] 

k 

is  a product  of  the  form  cj'j*" where  C Is  in  J, 

i-1 


each  Q is  primitive  and  positive  in  J[x],  and  the  Q’s  are 
i i 


pairuise  relatively  prime.  A rational  function  over  J[x]  is 
a numerator-denominator  pair  of  polynomials  in  J[x], 
P(x)/Q(x).  Each  such  rational  function  has  a canonical lu 
reduced  form  uhere  P and  Q are  relatively  prime  and  Q is 
positive.  Tuo  rational  functions  are  said  to  be  canonical lu 
egua I if  their  canonically  reduced  forms  are  identically 
equal.  A rational  function  P(x)/Q(x)  is  called  proper  if 
degiP)  < deg(Q)  and  improper  otherwise.  For  a canonically 
reduced  proper  rational  function  P tx) /□ Cx) , a souare-free 
Partial  fraction  (s.f.p.f.)  decomposition  of  P/Q  is 
k 


E 


P (x)/(C  Q Cx) ) uhere  Q has  a square  free  decomposition 
i i i 


i *1 


■ - - A . i'f  ifriT -y  . 


TTv 


i 

P is  in  Jtx],  C is  in  J,  and  deg(P  ) < deg(Q  ) 


for  all  1 < i < k.  By  successive  application  of  pseudo- 
division in  J[xl , each  P / (C  Q ) can  be  further  decomposed  In 


the  form 


\ AP  (x)/(C  Q (x) ) where  deg(P  ) < deg(Q  ) 
■ • J i • j i i , j i 


for  all  1 < j < i.  If  each  term  of  s.f.p.f.  decomposi- 
tion of  P/Q  is  so  expressed  such  that 


k i 


i’l  j»l 


P (x)/(C  Q (x)) 
' . j i t j i 


then  ue  have  the  ggm&lflig  MUars-fr.ee  Partial  fraction 
(c.r.f.p.f.)  decomposition  of  P/Q. 


■ 


■imjz  jw 
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